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Abstract. We introduce the notions of tree-like path and tree-Uke equivalence 
between paths and prove that the latter is an equivalence relation for paths of 
finite length. We show that the equivalence classes form a group with some 
similarity to a free group, and that in each class there is one special tree reduced 
path. The set of these paths is the Reduced Path Group. It is a continuous 
analogue to the group of reduced words. The signature of the path is a power 
series whose coefficients are definite iterated integrals of the path. We identify 
the paths with trivial signature as the tree-like paths, and prove that two 
paths are in tree-like equivalence if and only if they have the same signature. 
In this way, we extend Chen's theorems on the uniqueness of the sequence of 
iterated integrals associated with a piecewise regular path to finite length paths 
and identify the appropriate extended meaning for reparameterisation in the 
general setting. It is suggestive to think of this result as a non-commutative 
analogue of the result that integrable functions on the circle are determined, 
up to Lebesgue null sets, by their Fourier coefficients. As a second theme 
we give quantitative versions of Chen's theorem in the case of lattice paths 
and paths with continuous derivative, and as a corollary derive results on the 
triviality of exponential products in the tensor algebra. 



1. Introduction 

1.1. Paths with finite length. Paths, that is to say (right) continuous functions 
7 mapping a non-empty interval J C K into a topological space V, are fundamen- 
tal objects in many areas of mathematics, and capture the concept of an ordered 
evolution of events. 

If {V,dv) is a metric space, then one of 7's most basic properties is its length 
|7|j. This can be defined as 



7lj sup V dv (7t,_i,7tJ 



where the supremum is taken over all finite partitions V = {to < ti < ■ ■ ■ < tr} of 
the interval J. It is clear that I7I is positive (although possibly infinite) and indepen- 
dent of the parameterisation for 7. Letting T (t) = |7|[of] and setting (r (i)) = 7 (t) 
one sees that any continuous path of finite length can always be parameterised to 
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have unit speed. Paths of finite length are often said to be those of bounded or 
finite variation. 

Definition 1.1. We denote the set of paths of bounded variation by BV, BV-pa,ths 
with values in V by BV{V), and those defined on J by BV (J, V). 

If y is a vector space, then for any 7 G BV ([0, t],V) and t e BV ([0, s],V) we 
can form the concatenation 7 * t S BV {[0,s + t] , V) 

7 * r (u) — J (u) , u £ [0, s] 

7 * r (u) = r (u — s) + 7 (s) — r (0) , u £ [s,s + t] . 
The operation * is associative, and if Vis a normed space, then I7I + |r| — I7 * r|. 

1.2. Differential Equations. One reason for looking at BV{V) is that one can 
do calculus with these paths, while at the same time the set of paths with I7I j < I 
is closed under the topology of pointwise convergence (uniform convergence, . . . ). 
Differential equations allow one to express relationship between paths in BV. If 
are Lipschitz vector fields on a space W and 7^ = (7^^ (f) , . . . , 7^ (<)) G BV (R'^) 
then the differential equation 

i 

yo = a. 

has a unique solution for each 7. BV is a natural class here for, unless the vector 
fields commute, there is no meaningful way to make sense of this equation if the 
path 7 is only assumed to be continuous. 

If (y, 7) solves the differential equation and {y, 7) are simultaneous reparameter- 
isations, then they also solve the equation and so it is customary to drop the dt 
and write 

i i 

We can regard the location of j/s as a variable and consider the diffeomorphism Tr^t 
defined by TTst (?/s) := yt- Then Tr^f is a function of 7|[s^t] . One observes that the map 
l\[s,t] T^st is a homomorphism from (BV (V) , *) to the group of diffeomorphisms 
of the space W. 

1.3. Iterated integrals and the signature of a path. One could ask which are 
the key features of 7|[s,t]7 which, with y^, accurately predict the value yt in equation 
(jl.ip . The answer to this question can be found in a map from BV into the free 
tensor algebra! 

Definition 1.2. Let 7 be a path of bounded variation on [S,T] with values in a 
vector space V . Then its signature is the sequence of definite iterated integrals 

^S,T = (1 + Xg rp + . . . + Xg rp + . . .) 



= ( 1 + / d7„ + . . . + / d7„^ ^...^ d-f„^ 

Js<u<T J S<ui<...<Uk<T 



regarded as an element of an appropriate closure of the tensor algebra T(V) 
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The signature is the definite integral over the fixed interval where 7 is defined; 
re-parameterising 7 does not change its signature. The first term X^g produces 
the path 7 (up to an additive constant). For convenience of notation, when we 
have many paths, we will sometimes use a symbol such as Yt (instead of 7^) for our 
path, Yg J, for the z-th coordinate of the signature of Yt, and Ys.t for the signature 
of the path. In some circumstances we will drop the time interval and just write 
Y for the path and Y for its signature. We call this map the signature map and 
sometimes denote it by S" : X — > 5 {X) when this helps our presentation. 

The signature of X is a natural object to study. The map X — > X is a homomor- 
phism (c.f. Chen's identity [7]) from the monoid of paths with concatenation to (a 
group embedded in) the algebra T {V). The signature X(= Xo,t) can be computed 
by solving the differential equation 

d'K.Q^u = '^o.u ® dXu 
Xo,o = (1,0,0,...), 

and, in particular, paths with different signatures will have different effects for some 
choice of differential equation. 

There is a converse, although this is a consequence of our main theorem. If X 
controls a system through a differential equation 

dY^ = f(Y^)dXu, 
Yq = a, 

and / is Lipschitz, then the state Yt of the system after the application of X|[q j^j is 
completely determined by the signature Xq.t and Yq. In other words the signature 
X5.T is a truly fundamental representation for the bounded variation path defined 
on [S, T] that captures its effect on any non- linear system. 

This paper explores the relationship between a path and its signature. We deter- 
mine a precise geometric relation ^ on bounded variation paths, we prove that two 
paths of finite length are ^-equivalent if and only if they have the same signature: 

X\j - Y\k ^ Xj^Yk 

and hence prove that is an equivalence relation and identify the sense in which 
the signature of a path determines the path. 

The first detailed studies of the iterated integrals of paths are due to K. T. Chen. 
In fact Chen [2] proves the following theorems which are clear precursors to our 
own results: 

Chen Theorem 1: Let dj^, • • • , d'f^ be the canonical 1-forms on M''. If a, /3 G 
[a, b] R'^ are irreducible piecewise regular continuous paths, then the iterated 
integrals of the vector valued paths '^'^ ^^'-^ -fpio) Sigiee if and only if there 
exists a translation T of R'^, and a continuous increasing change of parameter 
A : [a, b] — + [a, b] such that a = Tf3X. 

Chen Theorem 2: Let G be a Lie group of dimension d, and let oji - ■ - Wd 
be a basis for the left invariant 1-forms on G. If a, G [a, b] ^ G are irreducible 
piecewise regular continuous paths, then the iterated integrals of the vector valued 
paths j^lo) ^^"^ -fpio) agree if and only if there exists a translation T of G, and 
a continuous increasing change of parameter A : [a, b] —> [a, b] such that a = 



We borrow these formulations from the Math Review of the paper but include the precise 
smoothness assumptions. 
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In particular, Chen characterised piecewise regular paths in terms of their sig- 
natures. 

1.4. The main results. There are two essentially independent goals in this paper. 

(1) To provide quantitative versions of some of Chen's results. If 7 is con- 
tinuous, of bounded variation and parameterised at unit speed, then we 
will obtain lower bounds on the coefficients in the signature in terms of 
the modulus of continuity of 7 and the length of the path. For example 
Theorem [5] shows how one can recover the length of a path 7 using the 
asymptotic magnitudes of these coefficients (c.f. Tauberian theorems in 
Fourier Analysis). A detailed discussion is to be found in Sections 2 and 3. 

(2) To prove a uniqueness theorem characterising paths of bounded variation 
in terms of their signatures (c.f. the characterisation of integrable functions 
in terms of their Fourier series) extending Chen's theorem to the bounded 
variation setting. 

For this second goal we need a notion of tree-like path, our definition codes R- 
trees by positive continuous functions on the line, as developed, for instance, in 
0. 

Definition 1.3. Xt, t Q [0, T] is a tree-like path in V if there exists a positive real 
valued continuous function h defined on [0,T] such that h{Q) — h (T) = and such 
that 

\\Xt~ XsWv <h{s) + h{t) -2 inf h{u). 

ue[s,t] 

The function h will be called a height function for X. We say X is a Lipschitz 
tree-like path if h can be chosen to be of bounded variation. 

Definition 1.4. Let X,Y £ BV{V). We say X ^ Y ii the concatenation of X and 
Y 'run backwards' is a Lipschitz tree-like path. 

We now focus on M'' and state our main results. 

Theorem 1. Let X e BV{R'^). The path X is tree-like if and only if the signature 
ofX isO = (1,0,0,...). 

As the map X — > X is a homomorphism, and running a path backwards gives 
the inverse for the signature in T (V), an immediate consequence of Theorem [1] is 

Corollary 1.5. If X,Y E BV{R'^), then X = Y i/ and only if the concatenation 
of X and Y run backwards' is a Lipschitz tree-like path. 

Corollary 1.6. For X,Y E BV{M.'^) the relation X ^ Y is an equivalence relation. 
Concatenation respects ^ and the equivalence classes S form a group under this 
operation. 

There is an analogy between the space of paths of finite length in M"^ and the 
space of words a^^b^^ . . . c^^ where the letters a,b, . . . ,c are drawn from a d-letter 
alphabet A. Every such word has a unique reduced form. This reduction respects 
the concatenation operation and projects the space of words onto the free group. 
We extend this result from paths on the integer lattice (words) to the bounded 
variation case. 

Corollary 1.7. For any X E BV{M.'^) there exists a unique path of minimal length, 
X, called the reduced path, with the same signature X = X. 
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Taking these results together we see that the reduced paths form a group. The 
muhiphcation operation is to concatenate the paths and then reduce the resuh. One 
should note that this reduction process is not unique (although we have proved that 
the reduced word one ultimately gets is). This group is at the same time rather 
natural and concrete (a collection of paths of finite length) , but also very different to 
the usual finite dimensional Lie groups. It admits more than one natural topology, 
and multiplication is not continuous for the topology of bounded variation. 

We can restate these results in different language. The space BV with *, the 
operation of concatenation, is a monoid. Let T be the set of tree-like paths in BV. 
Then T is also closed under concatenation. If 7 e BV and we use the notation 7"^ 
for 7 run backwards. It is clear from the definition that •y^^Tj C T for all 7 G BV. 
As we have proved that tree-like equivalence is an equivalence relation BV/T is 
well defined, closed under multiplication, and has inverses; it is a group. 

We have the following picture 

^ r ^ S ^ 

where one can regard E as the ^-equivalence classes of paths or as the subgroup 
of the tensor algebra. The map <--- takes the class to the reduce path which is an 
element of BV . As T has no natural i?T^-normal sub-monoids, one should expect 
that any continuous homomorphism of BV into a group will factor through E if it 
is trivial on the tree-like elements. It is clear that the set 

T = {(7, /i) , 7 e T, ha, height function for 7} 

is contractable. An interesting question is whether T itself is contractable. 

We prove in Lemma 16.31 that any 7 g T is the limit of weakly piecewise linear 
tree-like paths and hence T is the smallest multiplicatively closed and topologically 
closed set containing the trivial path. This universality suggests that S has similar- 
ities to the Free group. One characterising property of the free group is that every 
function from the alphabet A into a group can be extended to a map from words 
made from A into paths in the group. The equivalent map for bounded variation 
paths is Cartan development. Let he a. linear map of to the Lie algebra g of 
a Lie group G and let Xt\t<T be a bounded variation path. Then Cartan develop- 
ment provides a canonical projection of 6 [X) to a path Y starting at the origin in 
G and we can define 9 : X ^Yt. This map is a homomorphism from E to G. 

It is an exercise to prove that this map takes all tree-like paths to the identity 
element in the group G. As a consequence, is a map from paths of finite variation 
to G which is constant on each ^ equivalence class and so defines a map from E to 
G. 

Let Xt\t<T be a path of bounded variation in and suppose that for every 
linear map into a Lie algebra g, that {X) is trivial. As the computation of the 
first n terms in the signature is itself a development (into the free rt-step nilpotent 
group) we conclude that Xo,t — (1, 0,0,.. .) and so X is tree-like. In this way we 
have a 

Corollary 1.8. A path of hounded variation is tree-like if and only if its develop- 
ment into every finite dimensional Lie group is trivial. 

The observation that any linear map of M'^ to the Lie algebra g defines a map 
from E to the Lie group is a universal property of a kind giving further evidence 
that E is some sort of continuous analogue of the free group. However, E is not a 
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Lie group although it has a Lie algebra and it is not characterised by this property. 
(Chen's piecewise regular paths provide another example since they are paths of 
bounded variation and are dense in the unit speed paths of finite length). 

1.5. Questions and Remarks. How important to these results is the condition 
that the paths have finite length? Does anything survive if one only insists that 
the paths are continuous? 

The space of continuous paths with the uniform topology is another natural 
generalisation of words - certainly concatenation makes them a monoid. However, 
despite their popularity in homotopy theory, there seems little hope that a natural 
closed equivalence relation could be found on this space that transforms it into a 
continuous 'free group' in the sense we mapped out above. The notion of tree-like 
makes good sense (one simply drops the assumption that the height function h is 
Lipschitz). With this relaxation, 

Problem 1.9. Does ^ define an equivalence relation on continuous paths? 

Homotopy is the correct deformation of paths if one wants to preserve the line 
integral of a path against a closed one-form. On the other hand tree-like equivalence 
is the correct deformation of paths if one wants to preserve the line integral of a 
path against any one form. As we mention elsewhere in this paper, integration of 
continuous functions against general one forms makes little sense. This is perhaps 
evidence to suggest the answer to the problem is in the negative. The problem is 
in the transitivity of the relation. 

Problem 1.10. Is there a unique tree reduced path associated to any continuous 
path? 

For smooth paths 7 — (71,72) hi Cartan development into the Heisenberg 
group is the map (71,72) (7n72'/ 7i^72) ■ ^'^'^ knows [21 Proposition 1.29] 
that there is no continuous bilinear map extending this definition to any Banach 
space of paths which carries the Wiener measure. We also know from Levy, that 
there are many "almost sure" constructions for this integral made in similar ways 
to "Levy area" . All are highly discontinuous and can give different answers for the 
same Brownian path in . This wide choice for the case of Brownian paths (which 
have finite variation for every p > 2) makes it clear there cannot be a canonical 
development for all continuous paths. 

The paper [7j sets out a close relationship between differential equations, the 
signature, and the notion of a geometric rough path. These "paths" also form a 
monoid under concatenation and any linear map from R*^ into the [p + £)-Lipschitz 
vector fields on a manifold M induces a canonical homomorphism of the p-rough 
paths with concatenation into the group of diffeomorphisms of M so they certainly 
have the analogy to the Cartan development property. Similarly, every rough path 
has a signature, and the map is a homomorphism. 

Problem 1.11. Given a path 7 of finite p-variation for some p > 1, is the triviality 
of the signature of j equivalent to the path being tree-like? 

Our theorem establishes this in the context of p = 1 or bounded variation paths 
but our proof uses the one dimensionality of the image of the path in an essential 
way. An extension to p-rough paths with p > 1 would require new ideas to account 
for the fact that these rougher paths are of higher "dimension" . 
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There seem to be many other natural questions. 

By Corollary 11.71 among paths of finite length with the same signature there is 
a unique shortest one - the reduced path. Successful resolution of the following 
question could have wide ramifications in numerical analysis and beyond. The 
question is interesting even for lattice paths. 

Problem 1.12. How does one effectively reconstruct the reduced path from its 
signature ? 

A related question is to: 

Problem 1.13. Identify those elements of the tensor algebra that are signatures 
of paths and relate properties of the paths (for example their smoothness) to the 
behaviour of the coefficients in the signature. 

Some interesting progress in this direction can be found in [3]. 
We conclude with some wider comments. 

(1) There is an obvious link between these reduced paths and geometry since 
each connection defines a closed subgroup of the group of reduced paths 
(the paths whose developments are loops). 

(2) It also seems reasonable to ask about the extent to which the intrinsic struc- 
ture of the space of reduced paths (with finite length) in d > 2 dimensions 
changes as d varies. 

1.6. Outline. We begin in Section 2 by discussing the lattice case. In this setting 
we can obtain our first quantitative result on the signature. We do not have best 
possible estimates, but we can prove that a word in the free group of length L in 
two generators is completely reducible if the first [elog (l + v^) LJ terms in the 
signature of the path in the lattice corresponding to the word are zero. The case 
of words in d generators is also treated and if the first c{d)L terms in the signature 
are zero, the word is reducible, where the constant c{d) grows logarithmically in d. 

In Section 3 we extend these quantitative estimates to finite length paths. In 
order to do this we need to discuss the development of a path into a suitable version 
of hyperbolic space - a technique that has more recently proved useful in [5] . Using 
this idea we obtain a quantitative estimate on the difference between the length 
of the developed path and its chord in terms of the modulus of continuity of the 
derivative of the path. This allows us to obtain, in the case where the derivative is 
continuous, some estimates on the coefficients in the signature and also shows how 
to recover the length of the path from the signature. 

We can also prove for example that any path with bounded local curvature and 
the first N terms in the signature zero must be rather long or trivial - a sort of 
rigidity theorem. We can obtain explicit bounds depending only on the curvature 
bounds and N. However they are far from sharp as we can see from the figure of 
8, a path with curvature at most An and length one. It is clear that the first two 
terms in its signature zero, but our results indicate that it cannot have all of the 
first 115 terms in the signature zero! 

After this we return to the proof of our uniqueness result, the extension of Chen's 
theorem. Our proof relies on various analytic tools (the Lebesgue differentiation 
theorem, the area theorem) , and particularly we introduce a mollification of paths 
that retain certain deeply non-linear properties of these paths to reduce the prob- 
lem to the case where 7 is piecewise linear. Piecewise linear paths are irreducible 
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piecewise regular paths in the sense of Chen and thus the result follows from Chen's 
Theorem. The quantitative estimates we obtained give an independent proof for 
this piecewise linear result. 

In Section |4] we establish the key properties for tree- like paths that we need. In 
Section 5 we prove that any path ^t|(g[o.T] ^ BV and with trivial signature can, 
after re-parameterisation, be uniformly approximated by (weakly) piecewise linear 
paths with trivial signature. This is an essentially non-linear result as the constraint 
of trivial signature corresponds to an infinite sequence of polynomial constraints of 
increasing complexity. In Section 6 we show that, by our quantitative version of 
Chen's theorem, such piecewise linear paths must be reducible and so tree-like in 
our language. 

This certainly gives us enough to show, in Section 7, that any weakly piecewise 
linear path with trivial signature is tree-like. It is clear from the definitions that 
uniform limits of tree- like paths with uniformly bounded length are themselves tree- 
like. Applying the results of section 5 the argument is complete. We draw together 
all the parts to give the proofs of our main Theorem and Corollaries in Section 8. 

2. Paths on the integer lattice 

2.1. A discrete case of Chen's theorem. Consider an alphabet A and new 
letters = {a~^, a € A}. Let il be the set of words in A U A~^. Then ft has 
a natural multiplication (concatenation) and an equivalence relation that respects 
this multiplication. 

Definition 2.1. A word w £ H. is said to cancel to the empty word if, by applying 
successive applications of the rule 

a . . . bcc~^d . . . e — > a . . . 5d . . . e, a, 6, c, d, e, . . . G A U A~'^ 

one can reduce w to the empty word. We will say that (a ... 5) is equivalent to 
(e.../) 

{a...b) ~ (e.../) 
if {a . . . bf^^ . . . e~^) cancels to the empty word. 

An easy induction argument shows that ~ is an equivalence relation. It is well 
known that the free group F^^ can be identified a,sfl/~. There is an obvious bijection 
between words in Jl, and lattice paths, that is to say the piecewise linear paths Xu 
which satisfy xq = and \\xk — Xk+i \\ = 1, are linear on each interval u G [k,k + l], 
and have Xk G Z'"^' for each k. The length of the path is an integer equal to the 
number of letters in the word. The equivalence relation between words can be re- 
articulated in the language of lattice paths: Consider two lattice paths x and y, 
and let z be the concatenation of x with y traversed backwards. Clearly, if x and y 
are equivalent then, keeping its endpoints fixed, z can be "retracted" step by step 
to a point while keeping the deformations inside what remains of the graph of z. 
The converse is also true: if U is the universal cover of the lattice, and we identify 
based path segments in the lattice with points in U then the words equivalent to 
the empty word correspond with paths Xt in the lattice that lift to loops in U . 
They are the paths that can be factored into the composition of a loop in a tree 
with a projection of that tree into the lattice. A loop in a tree is a tree-like path, 
as one can use the distance from the basepoint of the loop as a height function. 
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Chen's theorem tehs us that any path that is not retractable to a point in 
the sense of the previous paragraph has a non-trivial signature. Our quantitative 
approach allows us to prove an algebraic version of this result. Let 7^ be the lattice 
path associated to the word w — a^^ . . . a^^ (where a = (cti, . . . , ctl) G {±1}^ gives 
the signs associated to each letter). As the signature is a homomorphism, we have 

^ ilw) — ^ (lal^^ ■■■^{lal^^- Since j^^i is a path that moves ai units in a 
straight line in the direction, its signature is the exponential and >5'(7j„) = 
e'^i"! (g) . . . (g) e"^^"^. 

Our quantitative approach will show in Theorem [5] that for a word of length L 
in a two letter alphabet, if 

e'l'^i (g) . . . (g e"^''^ = (1, 0, 0, . . . , 0, x^W+i, . . .) , ae {±1}^ 

where N (L) = [e log (l + y/2) L\, then the re is an i for which Ui = — a^+i and by 
induction the reduced word is trivial. 

The proof is based on regarding Mf^ as the tangent space to a point in d- 
dimensional hyperbolic space H, scaling the path 7 and developing it into hyperbolic 
space. There are two ways to view this development of the path, one of which yields 
analytic information out of the iterated integrals, the other geometric information. 
Together they quickly give the result. We work in two dimensional hyperbolic space 
and, at the end, show that the general case can be reduced to this one. 

2.2. The universal cover as a subset of H. Let X be a lattice path in R^, 
9 > 0, and = OX be the re-scaled lattice path. The development of X^ into 
H moves along successive geodesic segments of length 6 in H, each time X^ turns 
a corner, so does and angles are preserved. 

For a fixed choice of 9 we can trace out in H the four geodesic segments from the 
origin, the three segments out from each of these, and the three from each of these, 
and so on. It is clear that if the scale 9 is large enough, the negative curvature forces 
the image to be tree. This will happen exactly when the path that starts by going 
along the real axis and then always turns anti-clockwise never hits its reflection in 
the line x = y. 

The successive moves can be expressed as iterations of a Mobius transform, 

—ir + X 
—i — rx 
m" (0) , 

and if r = l/\/2, then the trajectory eventually ends at (1 + i) /v^. Hyperbolic 
convexity ensures that all these trajectories are (after the first linear step) always in 
the region contained by the geodesic from (1 -I- i) /^/2 to (1 — i) In particular 
they never intersect the trajectories whose first move is from zero to i, to — i, or to 
— 1. Now, there is nothing special about zero in this discussion, and using conformal 
invariance it is easy to see that 

Lemma 2.2. If is at least equal to the hyperbolic distance from to 1/V2 in M, 
then the path Yf takes its values in a tree. This value 1/^/2 is sharp. 

We have developed X^ into a tree in H; we have already observed that a loop in 
a tree is tree- like. If we can prove that Y^ =^0 ^ the Y^ will be tree-like and hence 



m (x) := 
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SO will X^a,nd X. To achieve this we must use the assumption that the path has 
finite length and that all its iterated integrals are zero from a different perspective. 

2.3. Cartan development as a linear differential equation. If G is a closed 
subgroup of the matrices, and Xt\t<T is a path in its Lie algebra g, then the equation 
for the Cartan development Mt e G of Xt\t<T S is given by the differential 
equation 

Mt^st pa M^exp (SXt) or equivalently dMt = MfdXt. 
The development of a smooth path in the tangent space to in H 

M = {zgC\ \\z\\ < 1}, 

is also expressible as a differential equation. However, it is easier to express this 

development in terms of Cartan development in the group of isometrics regarded 
as matrices in GL(2, C) rather than on the points of H. We identify with the 
Lie subspace 

X + iy 
X — iy 

In this representation, the equation for is linear and so we have an expansion 
for M: 



Mt = Mo{I+ dXu+ dXu,dXu2 

\ Jo<u<T Jo<ui<U2<T 

KA- , a b 
Mo X 



where 



1 + ^2 I dXu-^dXu^ ■ ■ ■ dXu2k-ldXu2t: 

f. J0<Ul<U2< — <U2k<T 

b = / dXu-^dXu2 ■ ■ ■ dXu2k-i 

f. J0<Ul<U2<.-.<U2k-l<T 

and Jo^ui<---<u2k<T ■ ■ ■ dXu2k-idXu2k is now, with an abuse of notation, 

a complex number. We have an a priori bound: 



Lemma 2.3. If X is a path of length exactly 6L, then 

dXu^dXu2 ■ ■ ■ dXu2^_idXu2 



I 

Jo 



0<Ul<U2<.-.<U2k<T 



^ {2k)\ ■ 



To use this lemma we need to be able to estimate the tail of an exponential 
series. The following lemma (based on Stirling's formula) articulates a convenient 
inequality. 

Lemma 2.4. Let x>l/e. (1) ^ < ^^i/i holds for all m > ex. 

(2) If for any k one has m> ex + k, then 

y^< —Jl e-^a;-V2 ~ 0.38 e-'^x-'/^ 
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— ana is approacnea monoi 



Proof. By Stirling's formula limy_>oo - — |r — — "^^^ is approached monotonely 



from below. It is an upper bound and also a good global approximation to - — - 
valid for all j/ > 1. Putting y = ex gives 



< 



(ex)! 



< e 2 . 



(ex)! 



Moreover the recurrence relation for the ! function implies, for every fc e Z with 
ex + k > 0, that 



> e 



x" 



(ex)! (ex + fc)! 
and so 



^ex+k _ _^ I 

< e 2x 2 . 



(ex + ky. 

establishing the first claim. Now summing this bound we have 



< e 2 ^_ 2^ e 



-fc 



^ (ex + A;)! ~ " " V27r 
fc>o ^ ^ ^ fc>o 

62 1 1 

iX 2 . 



e - 1 x/27r 

Since for ex > the function fc ,^ is monotone decreasing on M+ we see 

(e2:+fc)! ^ 

that 

X - X™ 62 1 1 

> < X~2 

m\ - e - 1 ^ 



m>ex 



completing the proof of the lemma. Finally we note the approximate value of the 
constant: 

62 

0.38. 



V2^(6 - 1) 

□ 

2.4. The signature of a word of length L. We deduce the following totally 
algebraic corollary for paths X that have traversed at most L vertices. 

Theorem 2. // a path of length L in the two dimensional integer lattice (corre- 
sponding to a word with L letters drawn from a two letter alphabet and its inverse), 
has the first [elog(l + V^)L\ GL{2,C)-iterated integrals^ zero, then all iterated 
integrals (in the tensor algebra) are zero, the path is tree-like, and the corresponding 
reduced word is trivial. 



G-L(2, C)-iterated integrals: since our path is in a vector subspace of the algebra GL{2,C) 
we may compute the iterated integrals in the algebra GL{2,C) or in the tensor algebra over the 
vector subspace. There is a natural algebra homomorphism of the tensor algebra onto GL{2,C). 
The GL{2, C)-iterated integrals are the images of those in the tensor algebra under this projection 
and a priori contain less information. 
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Proof. Any Mobius transformation preserving the disk can be expressed as 

1 

/ r I I \ / — 

M 



z 
z 



r 1 



u 
Q 



where \z\ — \uj\ — \ and r is the EucUdean distance from to MO. Now 



Tt{A-B\ = ^5]a.A- 



» 1 



Tr BA 



[' 



and 



hence 



Tr 



uj 
cj 



MM 



UJ 

LU 



1 
1 



Tr 



\/l-r2 Vl-i 
r 1 



^ VI — ^"^ VI — r-^ 

2(l + r^) 



(l-r2) 



Letting r = 1/a/2 we see that if 



Tr 



MM 



<6, 



then the image of under the Mobius transformation must he in the circle of radius 
l/>/2- On the other hand 



Tr 





a 






b ' 




[( 


b 




it 


a 


)] 



{H' + \b\- 



and in our context, where the first N iterated integrals are zero, this gives the 
inequality 

2 

1 + / dXuidXu2 ■ ■ ■ d^U2k-ldXu2k 

f.-^jyJ0<Ui<-<U2k<T 



(2.1) 



E 

fc>JV 



.dX„ 



0<ui<-<U2k-i<T 



< 6. 



Using our a priori estimate from Lemma 12.31 we have that the inequality will hold 
if 



2fe-l 



(2fc- 1)! 



E 



2k 



(2fc)! 



< 6. 



\ k>N ' ' / \fe>JV 

Observing that, as we will choose N > 9L, the terms in the sums are decreasing, 
we have 



E 

k>N 



(2fc-i; 



< s 



and then 



E 

k>N 



m 

{2k)\ 



2k 



< s. 
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We see that p.ip will always be satisfied if we choose s such that 



Hence, if 



2(. + .»)<5a.di:p^<, 

k>N ^ ' 



^ (2/c- 1)! ^ 2 ^ 



then yT = 0. By Lemma [2:41 f 2) with x = log(l + \/2)L, we have if iV > elog(l + 

vMi±^^ < 0.38..(log(l + ^/2)L^V2 
to! 

m>Ar 

VTT- 1 

for aU L > 1. 

Observe that if 6* > log [l + \/2] then = MjO lies in a tree, and the develop- 
ment of Y is such that every vertex of the tree is at least a distance Q from the origin 
except the origin itself. By our hypotheses and the above argument (i[lT,0] < Q 
and hence It = 0. Therefore Y is tree-like and the reduced word is trivial. □ 



Finally we note that the case of the free group with two generators is enough to 
obtain a general result as the free group on d generators can be embedded in it. 

Lemma 2.5. Suppose that Td is the free group on d letters e; and that T is the free 
group on the letters a, b. Then we can identify /; g F so that the homomorphism 
induced by Ci fi from Yd to F is an isomorphism and so that the length of the 
reduced words fi are at most \ fi\ < 2 [logg |] + 3. 

Proof. It is enough to show that we can embed F23!-i into F so that each fi has 
length I. Consider the collection of all reduced words of length I in F. There are 
43'~^ of them if / > 0. Partition them into pairs, so that the left most letter of each 
of the words in a pair is the same up to inverses. Order them lexicographically. Now 
consider the space which is the ball in the Cayley graph of F comprising reduced 
words with length at most I. It is obvious that this is a contractable space. Now 
adjoin new edges connecting the ends of our pairs. Associate with each of the new 
edges the alternate letter and orient the edge to point from the lower to the higher 
word in the lexicographic order. Then this new space A is contractable to 23'""'^ 
loops and so has the free group F23!-i as its fundamental group. On the other 
hand, we can obviously lift any path in A to the Cayley graph of F; the map from 
loops in A to F is a homomorphism. The monodromy theorem tells us that this 
homomorphism induces a homomorphism of the homotopy group of A to F. As F is 
a tree, any two lifts of paths with the same endpoint in F are homotopic relative to 
those endpoints in graph F. Therefore we can associate every point in the homotopy 
group of A with a unique element of F and see that the homomorphism is injective. 
So we see that the image is a copy of the free group F23!-i. The generators of the 
classes in A clearly lift to paths of length 2Z -|- 1 in F and we take the end points of 
these paths to be the fi. □ 
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Theorem 3. // X is a path of length L in the d- dimensional integer lattice and 
the projections into GL{2,C) of the first [(2 [logg |] + 3)elog(l + V2)L\ iterated 
integrals are zero, then the path is tree-like. 

In this section, our arguments depend on the tree-hke nature of the development 
of the path in the lattice and little else - this is a property of the development into 
any rank one symmetric space but is still plausible, if less obvious for general ho- 
mogeneous spaces. Each space will give rise to a different class of iterated integrals 
that are sufficient to determine the tree- like nature of a path in a 'jungle gym'. 
One should note that computing the iterated integrals is not the most efficient 
way to determine if a word is reducible if the word, as opposed to its signature, is 
presented. 

3. Quantitative versions of Chen's Theorem 

We work in the hyperboloid model for H (which embeds the space H into a d-l- 1- 
dimensional Lorentz space) because the isometrics of H extend to linear maps. 
Consider the quadratic form on R'^+^ defined by 

d 

Id {x, y) = ^ Xiyi - Xd+iVd+i 
1 

and the surface 

m = {x, id{x,x) = -i}. 

Then H is hyperbolic space with the metric obtained by restricting Id to the tangent 
spaces to H. (If a; G H then {y\Id {y,x) — 0} is the tangent space to H in R'^+-'^ 
and moreover Id {z, z) is positive definite on z G {y\Id (y, x) =^ 0} and so this inner 
product is a Riemannian structure on H). In fact, (see p83) distances in H can 
be calculated using Id 

(3.1) - cos\\d(x,y) = Id{x,y) 

If SO {Id) denotes the group of matrices with positive determinant preserving 
the quadratic form Id then one can prove this is exactly the group of orientation 
preserving isometrics of H. The Lie algebra of SO {Id) is easily recognised as the 
d + 1 dimensional matrices that are antisymmetric in the top left d x d block and 
symmetric in the last column and bottom row and zero in the bottom right corner. 
Then the development of a path 7 G M'' to SO {Id) and H (chosen to commute 
with the action of multiplication on the right in SO {Id)) is given by solving the 
following differential equation 

/ • • • d^l\ 

(3.2) drt= '■ ■•■ : r*. 

••• d-ff 
V d^l ■■■ d-ff J 

We define X to be the development of the path 7 to the path in H starting at 



(0, • • • ,0, 1) and given by 



Xt = Tto. 



3precisely, M e SO (/„) if h {{My^f , (Mx')') = (y, x) 
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Now we can write dTt = F{d'y^)Tt where 







F -.x 



xi \ 





V xi 





Xd 



Xd 





is a map from 
norm on 



to Hom I 



and ]R'^+^ and the operator norm on Hom ( 
Lemma 3.1. In fact ||i^||ffo„,(R<i+i^K''+i) = 1- 
Proof. Let e G K'' and / e M. Then for x E R'^ 

fx 



) , where for precision we choose the EucUdean 



pd+i 



F{x) 



and computing norms 

fx 
e.x 

and hence \\F\\Hom(R' 



<f\\x\\' + \\e\\'\\x\ 



= 1. 



□ 



3.1. Paths close to a geodesic. We are interested in developing paths 7 of fixed 
length / into paths F in SO (Id) and in the function 

g{j) := d(o,ro) 

giving the length of the chord connecting the beginning and end of the development 
of 7 into Hyperbolic space. Amongst these paths 7 of fixed length, straight lines 
maximise g as the developments are geodesies. The function p is a smooth function 
on path space [6] . Therefore one would expect that for some constant K 

q{i)>1- Ke^ 

whenever 7 is in the e-neighbourhood (for the appropriate norm) of a straight line. 
We will make this precise using Taylor's theorem. 

Suppose our straight line is in the direction of a unit vector v. If our path 7 is 
parameterised at unit speed we can represent it by 

(i7j — Qtvdt, 

where Qt is a path in the isometrics of M"^. In this discussion we assume that Qt is 
continuous and has modulus of continuity S. Of course, 7 is close to i ^ to if O is 
uniformly close to the identity. Consider the development Tto — {xt,xt) of 7 into 
the hyperboloid model of H defined by 



Xt 
Xt 

dxt 
dxt 



xt-Qtvdt 
xtQtvdt 



We know that \\xt\ 



1 = l^tl and that 
coshd (Fto, o) = —Id 

= Xt 



Xq = 
Xq 



Xt 
Xt 
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in other words coshfp (7|[o,t]) ~ Xt- 

Proposition 3.2. Suppose the one can express Qt in the form e"^* where At is a 
continuously varying anti- symmetric matrix and that ||^||oo ^ V < 1- Then 

IcoshT-XTl < 4'^M!oo 

Proof. Suppose e G [—1,1]. We can introduce a family of paths 7f with jI = 7^ 
and with 7° the straight line tv by setting 

d-fl = e^'^'vdt 
7o = 0. 

We can then consider the real valued function / on [— 1, 1] comparing the length of 
the development of 7^ and the straight line 

/(e) := coshe(7^|tg[o,T]) -coshT. 

Of course / (0) = and / < 0. Now [6l Theorem 2.2] proves that development of 
a path 7 is Frechet differentiable as a map from paths to paths in all p- variation 
norms with p G [1,2). 
It is elementary that 

d{lt-ir") - e^^^{l^e'^^^)vdt 

= hAt e'^' vdt + H^tA^te'^^'vdt 



where ht G [0, /i]. Working towards the 1-variation derivative 



te[o,T] 



< 


1 




Jte['d,T] 



]^hlAle'^*vdt 



< '— I \Af\dt 



2 



te[o,T] 



and e — > 7*^ is differentiable with derivative 

Ate'^'^'vdt, 

providing /j^jq | | < cx) . A similar estimate shows that the derivative of 7^^)'^ 
exists and is 

^^{2)^6 ._ A^^e'^^'vdt, 

providing /jg^yj |^(| dt < 00. From [51 Theorem 2.2] we know that the develop- 
ment map is certainly twice differentiable in the 1-variation norm and applying the 
chain rule it follows that / is a twice differentiable function on [—1,1]. On the other 
hand / (0) = and / (e) < for e e [-1, 1] so that /' (0) = and applying Taylor's 
theorem 

0>/(l)> inf^/"(£). 
£e[o,i] 2 

In fact the derivatives in e form a simple system of differential equations. If 



Xt J \ J \ Ut J 2 y 2; 
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then 









■( 


V dzl ) 




( dxt \ 




dyt 




\ dzt j 





A'^e^'^'vdt 

e^^'vdt 

Ate/^^vdt 



A^e^^'vdt 




with the initial conditions 



ig = 4 = 1 

^0 = y'o^o 

z§ = z§ = 0. 

The simple exponential bound on the solution of a linear equation shows that 

\zl\ < 4™''''{'^'-/*e[o>T]l^tM*'/«e[o,T]|^?H*}- 
Applying Taylor's theorem we have that 

If ||A||^ < 1 then / (e) > —^4-^, and as ||A||^ < ^7 < 1, then we can replace A by 

r]~^A and evaluate at t] to deduce that Ja (1) > giving us the uniform 

estimate we seek. □ 

3.2. Some estimates from hyperbolic geometry. We require some simple hy- 
perbolic geometry. Fix A (in hyperbolic space), and consider two other points B 
and C. Let 9a, Ob, and Oc be the angles at A, B, and C respectively. Let a, b, and 
c be the hyperbolic lengths of the opposite sides. Recall the hyperbolic cosine rule 

sinh(6) sinh(c) cos(^a) = cosh(6) cosh(c) — cosh(a) 

and note the following simple lemmas: 

Lemma 3.3. // the distance c from A to B is at least log ^fz^^^fje^) ; then 

I^bI < \eA\. 

Proof. Fix c and the angle 9a, the angle 9b is zero if 6 = and monotone increasing 
as 6 ^ 00. Suppose that \9b\ > \9a\- We may reduce b so that \9b\ = 16*711, now 
the triangle has two equal edges and applying the cosine rule to compute the base 
length: 

sinh(a) sinh(c) cos(^a) = cosh(a) cosh(c) — cosh(a) 

(cos |6'a|) e^^^ + e^" - cos 1 61^1 + 1 



= log 
< lim log 

a— *r - 

= log 



e2" + (cos 16*^1) e"^" - cos \9a\ - 1. 
(cos \9a\) e^" + e^" - cos |6»a| + 1 
-e2« + (cos 161^1) e2« - cosIOa] - 1 

cosiei^l + 1 



1 — cos \9a\ 



Lemma 3.4. We have a > & + c — log r; — ^s-- and thus if max (b, c) > log — — 

— ^ 1 — COS^A V ' / — o 1 — cos 

then a > min (6, c) . 



□ 
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Proof. Suppose consider triangles with fixed angle 0a and with side lengths Xb, Ac 
and resulting length a (A) for the opposite. Then 

A6 + Ac - a (A) 

is monotone increasing in A with a finite limit. Now 

sinh(A&) sinh(Ac) cos(^a) = cosh(A6) cosh(Ac) — cosh(a (A)) 
cosh(A6) cosh(Ac) , . , cosh(a (A)) 

COS^t'A) — 



Thus 



sinh(A6) sinh(Ac) sinh(A6) sinh(Ac) 

}^ • utl^'^^ux ^ = (« (A) - A6 - \b) + log 2 

A^oo sinh(Ao) sinh(Ac) a^oo 

lin 
= log 

a > b + c — log 



A6 + Ac-a(A) < lim (A6 + Ac-a(A)) 

A— ^cx) 



1 — COS 6 a 
2 



1 — cos 6 a 

Also, providing max {b, c) > log jn^^gj, one has a > min (6, c). □ 

Corollary 3.5. // the distance c from A to B is at least log (^ i-cos\eA\ ) ' ^^^^ 

\0b\ < \9a\, 

and a>b. 

The above lemma is useful in the case where the angles of interest are acute. 
But in some contexts we are interested in one angle is very obtuse in which case 
the following lemma gives much better information. 

Lemma 3.6. Suppose that 9a > 7r/2 and that the distance c from A to B is at 
least log {V2 + l)then 6b < {n- 9a) /2. 

Proof. The second hyperbolic cosine rule states that 

sin(^s) sin(^A) cosh(c) = cos(^c') + cos(^b) cos(^a) 
, , , cos(^c) + cos(^b) cos(^a) 
= sin(^.)sin(^.) 

Fix 6a > tt/2. By our assumptions cosh (c) > -\/2. and so 

cos(6>c) + cos{9b) cos{9a) ^ ^ 
sin(6'B) sin(6'A) ~ 
Since the sum of interior angles in a Hyperbolic triangle is less than tt one can 
conclude that Ob = a {tt — 9a) where < a < 1 and that 6b and 6c are in 
[0, 7r/2). To prove this lemma we need to show further, that a < ^. It is enough to 
demonstrate that, in the case 6a > 7r/2, and | < a < l,we have 

cos(^C') + cos(^b) cos(^a) 



sin(^s) sin(^A) 

It is enough to prove that 

1 + cos{9b) cos{9a) 
sin(^s) sin(6lA) 



< V2. 
< V2. 
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Replacing {tt — Oa) by t and rewriting 

. , , 1 — cosfar) cos(t) 

/ (a, t) := — — , . , , 

sin(Q;r) sin(Tj 

it is enough to prove that f {a,T) < \f2 li t < tt /2 and ^ < a < 1. The derivative 
in a of / (a, t) is 

(cos (r) — cos (ar)) 

T — ; ^ 2 — 

sin (r) sin (olt) 

and so / is strictly decreasing in a in our domain. Hence, if a > 1/2 then 

f{a,T) < f Q,r 

The derivative of / (i, r) is readily computed as 

1 (1 + 2cos(T/2))tan(r/4) 
8 cos(r/4)^cos(T/2)^ 
and this is seen to be positive so that 



which completes the argument. □ 

Lemma 3.7. Let = Tq < . . . < Ti < . . .T^ = T be a partition of [0,T]. Let 

(^t)tg[or] continuous path, geodesic on the intervals [T^, T^-i-i] |i=o,...,n-i ™ 

hyperbolic space with n > 1 where, at each Ti, the angle between the two geodesic 
segments: ZXi^iXxiXx^^i is in [29, n]. Suppose that each geodesic segment has 

length at least K (0) log (jz:^^) • 

(1) d (Xq, XTi) is increasing in i and for each i <n 

(3.3) d{Xo,XTj > d {Xo, Xt,_,) + d {Xt,_„ Xt,) - K (9) 

> K{0) 



and the angle between XTi_i, X^i and XqXt^ is at most 9. 
(2) We also have 

n 

< ^d(XT,_i,XT,) -d{Xa,XTj < {n-l)K{e). 

i=l 

Proof. We proceed by induction. Suppose d{Xi^,XTi) > K (9) and the angle 
ZXTi_iXTiXTo is at most 9. Now the angle ZXTi_iXTiXTi^-^ is at least 29 so that 
the angle ZXToXxj^Ti+i is at least 9. As d {Xq, Xxi) > LC (9) and our supposition 
d [Xxi, Xxi^-^) > K{9), Lemma and CoroUarv 13.51 implv 

d (Xo , Xt,+ J > d {Xo ,XT,)+d{XT„XT,^,)-K{9), 

and that ZXoXt^^^Xt^ < 9 proving the main inequality. Using the induction one 
also has the second part of the inequality 

d{Xo,XT,^^)>K{9). 
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The second claim is obtained by iterating ()3.3p . 

i 

d{Xo,XT,^,) > d{Xo,XT,) + Y,d{XT,,XT,^,) - iK (6) . 

Now rearrange to get the result. □ 

3.3. The main quantitative estimate. Let 7 in R'' be a continuous path of 
finite length I, and parameterised at unit speed. With this parameterisation 7 
can be regarded as a path on the unit sphere in R''. We consider the case where 
w — > 7 (u) is continuous with modulus of continuity 6^. If a g R, then the path 
7^ t — + aj(t/a) is also parameterised at unit speed, its length is al and its 
derivative has modulus of continuity S^^ {ah) — (h). Its development from the 
identity matrix (defined in (|3.2p ) into SO (Id) is denoted by r^. 

The goal of this section is to provide a quantitative understanding for Ta as we 
let a — + CX3. Our estimates will only depend on S and the length of the path. We 
let Rq ^ log(l + V2) 

Proposition 3.8. Let 7 in be a continuous path of length I. For each C < 1 and 
1 < A/ G N then for any a chosen large enough that al > MRq and 5 ("37^"^) < 

2 (V2 - Vl + C2) 4-T«o one has 

U( V \ „ ^ /^ 4^^° , 161og2 ^ al^ fM+lRo^' 
\d (o, r„o) - a^l < + —6. 



2C 7r2 / i?o V M 



a 



and ifa = \,l> MRa, providing S (^^Ro) < y 2 (\/2 - VTTC^) 4" 
have for paths of any length 



X „ , 4-r«o l61og2\ I , /M + l 



/ M + l \ 

We set Di (C, M) = I ^-^^ + ) ll^: and D2 {M) = ^i?o so that the 
inequality becomes 

(3.4) \d (o, Tao) - la\ < DiS^ {D2/af al. 

We note that Rq w .881374, 4^" w 3.34393 < 4:(m+i)r„/m < ^2B.o ^ 11.5154^ 
^^^f ^ ~ 1.12369. Fixing M 1, one immediately sees that the distance d(o, Fqo) 
grows linearly with the scaling and the chordal distance d (o, Tao) behaves like the 
length of the path 7 as a ^ cx). We also note that the shape of this result is 
reminiscent of the elegant result of Fawcett 3, Lemma 68]: that among C^— curves 
7 with modulus of continuity d-y (h) < nh one has sharp estimates on the minimal 
value of d (o. To) given by 



cosh (aly/l — K^) ~ 
inf cosh {d (o. To)) = 



1 



A natural question to ask is whether our estimate (which is non-infinitesimal and 
only needs information about S-y (2i?o)) can be improved to this shape and even to 
this sharp form. 
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Proof. The path 7q, := ^ — » (V*^) of length al and parameterised at unit speed; 
its derivative has modulus of continuity Sa '■ t (t/a). Because al > MRq we 

can fix i? = al/N, where R G [Rq, ^^^^Ro] and is a positive integer depending 
on a. Let ti = iR where i G [0, A^]. Let Gi G SO (Id) be the development of 
the path segment Ja\lti-i,ti] into SO {Id) and Ta.t be the development of the path 



segment 7a|[o,t]- We define Xq :— o e M and Xj 
the points Ta.t o on the path Ta.to. 

As the length of the path is greater than any chord 

N 

\al — d{o,Tao)\ = al — d {Xj^i^X, 



GfX.^i e H. Then X,- are 



N 



(3.5) 
and 



-^d(X,_i,X,)-d(Xo,Xjv). 



AT 



i=l 

N 



al-Y,d X,) > ^ d , XO - d iXo,XN) > 0. 

i=l i=l 

We now estimate each of these terms from above. 

For the first term we use our result on paths close to a geodesic. By Proposi- 
tion [221 we have 

coshd{Xi^i,X,) > coshi?- ' 4^ 



2 

'R\2 



— cosh R — 



Thus, using the convexity of cosh and hyperbolic trig identities, 
S f-)^ 

-21^4^^ > coshi?-coshd(X,_i,X,) 

> (i?-d(X,„i,X,))sinhd(X,_i,X,) 
= (i? - d (X,_i, X,)) y^coshd (X,_i, X,)' 



> (i?-d(X,_i,X,)) 
Now, for C < 1, providing 



A ('iiA2 ^ 
CoshE- ^I^4« 



V2- 



B\2 



- 1. 



V2- 



we have 



R\2 



Mi) 

2C 



-4^ > (i?-d(X,_i,X,)) 



This will follow if we choose a large enough such that our condition 



Ro 



22 



B.M. HAMBLY AND TERRY J. LYONS 



holds. 

Hence, summing over all the pieces, we have 
We now use our bounds on R to obtain 



(3.6) 



RoJ 2C 



Applying Lemma \3l6\ and Lemma [3771 we see that as i? > i?o = log(l + \/2), then 
the angle XqXt^Xt„_i_i is at least tt — 26^ (^) for each < n < TV — 1. 
Thus 



E 



R J ^\^l-cos(^-2<5^(f)) 
Since ( -k- loe; ( -; , ) ) is increasine; in u for u < tt, we have 

Xw^ ^ \ 1 — cos|7r— u| J J ^ — ' 



log ; \ 7:^TRTT < 4<5^ - -^log 



l-cos|7r-2J.^(f)| / ~ ''V"/ \{TT/2f Vl-cos|7r/2| 



rV f 1 , / 2 



^ A ,'i?Vl61og2 . 
< o-y I — 1 — smce 



' a 



sJ-] < \I2(V2-VTTC^)4- 



which is less than 7r/4 for all C and M. 

5-y(i?) is increasing anu unau 



Observing that S~f{R) is increasing and that ^^Ro > R gives the second part 



of our estimate 

i3.7)±diX.^,,X.)-diXo,XrJ < ^g-l)^.(f)'- 
(3.8) < '-^(^-l)s. f^^ + '^o^' 



Ro J ' \ M a 
Combining the estimates p.6|) and (|3.8|) completes proof. □ 



3.4. Recovering the length of the path from its signature. From the last 
section we know that if a is large enough then 

(3.9) \d (o, Tao) - la\ < Did^ {D2/af al 

and in particular will go to zero as a — > oo if (5-,. (e) — o (e^/^). 

The lower bound on c?(o, Fqo) implicit in (|3.9p leads to a lower bound on the 
norm of Fq, as a matrix. We will compare it with the upper bound that comes 
from expressing the matrix F^ as a series whose coefficients are iterated integrals. 
We have an upper bound for each coefficient in the series, and taken together these 
provide a bound for the sum. This bound is so close to the lower bound that it 
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allows US to conclude a lower bound for each coefficient and relate the decay rate 
for the norms of the iterated integrals directly to the length of 7. 

It is an open question as to whether signatures with given decay rate correspond 
to paths of finite length. 

Proposition 3.9. Let G E SO (Id) ■ Then \\G\\ > e'*(°'^°) where \\G\\is the operator 
norm for G G Ham 1 

Proof. If 



/ 1 
1 



whe 



has the Euclidean 



\ 






V 





coshp 
sinh p 





sinh p 
cosh/9 / 



then FpFr = Fp^r and the set of such elements forms a (maximal) abelian subgroup 
of SO {Id}- Any element G of SO (Id) can be factored into a Cartan Decomposition 
KFpK where K and K are built out of rotations Q of R'^ 

e 

0* 1 



KFpK 



and p e M+. As an operator on Euclidean space, G has norm ||G|| = 

\\Fp\\ since K are isometries. In addition, the matrix Fp is symmetric and hence 
has a basis comprising eigenfunctions; its norm is at least as large as its largest 
eigenvalue. Computation shows that the eigenvalues of Fp are {e'', e"'', 1, • • • ,1} 
so that, given p > 0, one has 

IIGII >e''. 



On the other hand 



cosh d (o. Go) 



Id 



= — cosh p 



( 


{ ) 






\ 




Xd 











\ coshp / 




^ 1 ) 





and so |1G|| > e'^(°''5°). 



□ 



If 7 is a path of finite length then the development (|3.2p into hyperbolic space 
HI is defined by 

drt = F{dj,)Tu 

where by Lemma [3.11 -F : M'^ hom M'^+^) has norm one as a map from 

Euclidean space to the operators on Euclidean space. As a result the development 
of a path X is given by 



G = I+ F (dXu) 

JO<u<T 



F{dXuJ' 



0<ui<...<Uk<T 
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and, as in Lemma [231 if ^ is a path of length 6, then we have an a priori bound 



/ 



i<...<«fc<T 

Applying this to aj, we conclude that 



< 



JO<ui<...<uj.<l 
Jo<ui<...<Uk<l 



where I is the length of 7. Letting 

b„ = n! 
one has for all a that 

00 ^. 



0<Jii<...<«fc<l 



fc=0 

fc 



< bk<l 

Thus the expectation of 6„ with respect to a Poisson measure with mean al is close 
to one while at the same time the 5„ are all bounded above by one and positive. In 
particular 



00 i. 
a 



fe=0 



and so 



\l -bk\ < inf A:!c 

' ' a>l 



I _ p-Di5{D2/afal 



applying Stirling's formulae that kl — e'^^°^^ k+l\ogk+Ck -^vhere Cu — o(l) and 
setting a — k/l gives 

< l^C5{lD2/kf k\fk 

where C = Die^^ and so we see that, if 5^ {lD2/kf' k^^"^ — > as /c 00, then 
bk/l^ 1- Thus we have shown the following 

Theorem 4. For any path of finite length with 5^ (e) = o (e^/''), 



r'^ki 



0<ui<...<Uk<l 



1, 



as k ^ 00, 



This is of course quite a strong result obtained by making strong assumptions. 
One could ask less and so we give a weaker but more widely applicable result. 
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Theorem 5. Let j be a path of finite length I, and suppose its derivative, when 
parameterised at unit speed, is continuous. Then the Poisson averages C'a of the bk 
defined by 

oo ^ 
fc=0 

satisfy 

lim — log Ca — I — I 

Note that the Cq are averages of the bk against Poisson measures; it is standard 
that these are close to Gaussian with mean a and variance a. 



Proof. Note that 



and so 



oo 1^ 

^ fc! 

k=0 



(3.10) ^^i^ - ; < 1 log ( e-" ± ^bA +1-1 <0 

\ k=o J 



and using p.9|) we have 

d{p,Vao) 



a 

and so the left hand side in p.lOp goes to zero. □ 

In particular we see that the high order coefficients of the signature already 
determine the length of the path and in fact one can obtain quantitative estimates 
in terms of the modulus of continuity for the derivative of 7. 

Conjecture 3.10. The length of^y can be recovered from the asymptotic behaviour 
of averages of the bk . 

It might be that linia^oo 1 + ^ logCc gives the length of 7 directly although the 
Poisson averages may have to be replaced in some way. 

We conclude with an analogous result to that proved for the lattice case in 
Proposition [21 

Theorem 6. Let ^ be a path of length I parameterised at unit speed, and let 5^ 
be the modulus of continuity for 7. Fix C < 1 and 1 < A/ G N. Suppose that 
5^1 (0) < --j====, then there is an integer N (1,6) such that at least one of the 

first N(l, 6) terms in the signature must be non-zero. 

Proof. In the case where the first eal coefficients in the signature of the path 7 are 
zero, by Lemma 12.41 we have some explicit constant Ci such that 

lir.ll < 1+ E ^ 

^ — ^ ml 

7n>ea.l 

< l + Ci{al)-^^^. 
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By Proposition [3?9l and letting a be sufficiently large so that we can apply (13. 9p . 

||r„|| > e'^(°'^°°) 

^la-DiS^(D2/afal 

> l + la- Di5^{D2/af al. 

These two statements lead to a contradiction if for large a, we have la— Di^-y {02/0)^ al > 
Ci{al)-^l^ or 

Thus providing 1 > Di5^ (D^jcif' for some large a (continuity of the derivative is 
enough) then the left hand side goes to infinity as a 00. This always gives a 
contradiction and shows that existence of A^(Z, S). □ 

An explicit estimate is A^(/, S) — [eaZ] , where one chooses the smallest a > M3o.^ 
large enough so that 



and so that a^^"^ {l — Di5^ (D2/a)^^ > Cil~'^^^. To give an idea of the numerical 
size of N, the number of iterated integrals required for this result, we note that if 
the path has S{h) < h, then the optimal value of a is around 15.2 a± C — 0.8875 
(with M = 1) and the number N is the integer greater than 41.38Z (for large I). 
With a more careful optimization of the constants (varying M) our estimate can 
be reduced to the integer greater than 13.28Z (for large I). 

Remark 3.11. We note that this proof did not require that 5^ (0) = or that 7 is 
continuous. 

Remark 3.12. An easy way to produce a path with each of the first N iterated 
integrals zero is take two paths with the same signature up to the level of the iV'th 
iterated integral and to take the first path concatenated with the second with time 
run backwards. Since these paths will, except at the point of joining, have the same 
smoothness as they did before, all focus goes to the point where they join. One 
could hope that a development of these ideas would prove that the two paths must 
be nearly tangential. If this were exactly true, then it would give a reconstruction 
theorem. 

We have obtained quantitative lower bounds on the signature of 7 when 7 is 
parameterised at unit speed and 7 is close to continuously differentiable. In fact 
one could obtain estimates whenever the 7 is piecewise continuous and the jumps 
are less than tt. However the main extra idea is already visible in the case where 7 
is piecewise constant. We give an explicit estimate in Theorem |9] in Section [6l 



4. Tree-Like paths 

We now turn to our proof of the extension of Chen's theorem to the case of 
finite length paths. In this section we suppose that X^gjo^T] is a path in a Banach 
or metric space E and we recall our definition 11.31 of tree-like paths in this more 
general setting. 



UNIQUENESS FOR THE SIGNATURE 



27 



Theorem 7. If X is a tree-like path with height function h and, if X is of hounded 
variation, then there exists a new height function h having bounded variation and 
hence X is a Lipschitz tree like path; moreover, the variation of h is bounded by 

the variation of X . 

Proof. The function h allows one to introduce a partial order and tree structure 
on [0,T]. Let t G [0,T]. Define the continuous and monotone function gt{.) by 

gt (v) = inf h{u), v e [0, t] . 

v<u<t 

The intermediate value theorem ensures that gt maps [0, t] onto [0,h{t)]. Let Tj be 
a maximal inverse of h in that 

(4.1) Tt {x) = sup {u e [0, t] \gt iu)=x}, x€ [0, hit)]. 
As gt is monotone and continuous 

(4.2) Tt (x) = inf {u e [0, t] \gt (u) > x} 
ioY X < h (t). 

Now say s ^ t if and only if s is in the range of Tt ; that is to say if there is an 
X € [0,h (t)] so that s = Tt (x). Since Tt [h [t)) = sup{M S [0, t] \gt [u) — h (t)}, it 
follows that Tf (h (t)) = t and so t < t. Since h [Tt [x)) = x for x € [0, /i (t)] we 
see there is an inequality-preserving bijection between the {s\s ^ t} and [0, h {t)]. 

Suppose ti < to and that they arc distinct; then h (ti) < h (to). We may choose 
xi G [0, h (to)) so that ti — Tt„ (xi), it follows that 

h = Tto [xi) 

= inf {m e [0,io] Iffto (■") > , 

and that 

h (ti) = xi <h{u) , uG {ti,to\ ■ 

Of course 

9to (*i) = , >f h{u) 

ti <:U<tQ 

= h{ti) 
= 9ti {ti) , 

and hence gtg (u) = gt^ {u) for all u G [0,ti]. Hence, Ttg (x) = rti (x) for any 
X < gi^ (ti) = h (ti) = xi] wc have already seen that Tj^ [h [ti)) = ti = Tt„ (xi). It 
follows that the range Tt^ ([0, h (ti)]) is contained in the range of rtg. In particular, 
we deduce that if t2 ^ ti and ti ^ to then ^2^*0- 

Wc have shown that ^ is a partial order, and that {t\t ^ to} is totally ordered 
under ^, and in one to one correspondence with [0, h (to)]- 

Now, consider two generic times s < t. Let xq = infs<„<t h (u) and I = 
{v € [s, t] \h (v) = xq}. Since h is continuous and [s, t] is compact the set / is non- 
empty and compact. By the construction of the function gt it is obvious that gt < <?« 
on [0, s] and that if gt (w) = gs (u), then gt (v) = gs (v) for v G [0,u]. Thus, there 
will be a unique r G [0, s] so that g^ = gt on [0, r] and gt < gs on (r, s]. Observe that 
gt (r) = Xo and that Tt (xq) = sup/ and, essentially as above Tg = Tt on [0, h (r)). 
Observe also that if ? e [s, t] then gg = gi on [0, r] so that Tg = on [0, h (r)). 
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Having understood h and r to the necessary level of detail, we return to the path 
X. For X, y G [0, h {t)] one has, for x < y, 

||^r,(x) - X,,iy)\\ < h (rt {x)) + h {Tt {y)) - 2 inf h (u) 

«6[t(x),t(2/)] 

< x + y-2 inf h{Tt{z)) 

= y-x 

so we see that X^-^q is continuous and of bounded variation. 

The intuition is that -'frt(.) is the branch of a tree corresponding to the time 
t. Consider two generic times s < t, then X^^(^ -^ and -^Tt(.) agree on the ini- 
tial segment [0, /i(r)) but thereafter Tg (.) e [r,s] while Tf (.) e [sup/, f]. The 
restriction of -'^rt(.) to the initial segment [0,h{r)) is the path -^T3up/(.)- -^^ 
/i (r) = inf [h {u\u € [s,t])] they have independent trajectories after h (r). 

Let h (t) be the total 1-variation of the path X^^i^^y The claim is that h has total 
1-variation bounded by that of X and is also a height function for X . 

As the paths X^^(^/^ and ^rt(.) share the common segment X^-^^ -^ we have 

\\Xs-X,\\ = \\X,^^,)-X,^^,^\\<hit)-h{r)+h{s)-h{r), 
and in particular 

\\Xs-Xt\\ <h{s)+h{t)-2h{r). 
On the other hand h{r) =h (sup /) = infs<„<t (ji (u)^ and so 

||X is) -X{t)\\<h is) + h{t)-2 inf (h {u)) . 

s<u<t \ / 

and h is a height function for X. 

Finally we control the total variation of h by cux , the total variation of the path. 
In fact, 

h{s)-hit) < h{s) + h{t)-2 inf (h{u)] 

s<u<t V / 

where uJx{s,t) = sup^jgp^^ HXf^^^ — Xt^\\ |, with D denoting the set of all par- 
titions of [s,t] and for D £ V, then D = {s < ■ ■ ■ < U < ti+i < ■ ■ ■ < t}. The 
first of these inequalities is trivial, but the second needs explanation. As before, 
notice that the paths A"t-^( ), and A't-jJ ) share the common segment A^rr(.) and that 

infs<„<t {h (m)^ — h{r). So h (s) + h{t) — 2 infs<„<t ^ft, (m)^ is the total length of 

the two segments -'^rs(.)|[/i(r),/i(s)] and XT^(^^)\[h{r),h{t)]- Now the total variation of 
Xrt(.)\[hir)Mt)] is obviously bounded by (sup/,i), as the path X.rt(.)|[h(r),h(t)] is 
a time change of X \ [g^p / . 

It is enough to show that the total length of -'^Ts(.)|[/i(r),/i(s)] is controlled by 
wx (s,inf 7) to conclude that 

h{s) + h (t) - 2 inf (h («)) < ujx (s, t) . 

s<u<t \ J 

In order to do this we work backwards in time. Let 
U{u) = inf h{v) 

s<v<u 

Pt{x) = M{uG[s,T]\f,{u)=x} 
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then, because X is tree-like 

^r,(.)l[0,ft(.)] = ^p,(.)l[0,/i(s)], 

and in particular, the path segment ^p^(. ) |[/i(r), /i(s)] is a time change (but back- 
wards) of X|[^^i„fj]. □ 

The property of being tree-like is re-parameterisation invariant. We see infor- 
mally that a tree-like path X is the composition of a contraction on the i?-tree 
defined by h and the based loop in this tree obtained by taking t € [0, T] to its 
equivalence class under the metric induced by h (for definitions and a proof see the 
Appendix) . 

Any path that can be factored through a based loop of finite length in an R 
-tree and a contraction of that tree to the space E is & Lipschitz tree-like path. 
If is the root of the tree and (/) is the based loop defined on [0,T], then define 
h{t) = d (0, (p (t)). This makes 4> a tree-like path. Any Lipschitz image of a tree-like 
path is obviously a Lipschitz tree-like path. 

We have the following trivial lemma. 

Lemma 4.1. A Lipschitz tree-like path X always has bounded variation less than 
that of any height function h for X . 

Proof. Let V = {to < ... < i„} be a partition of [0,T]. Choose Ui £ [ti^i,ti] 
maximising h (ti) + h (ti-i) — 2h (ui) and let V — {to < ui . . . < tn-i < u„ < i„} . 
Relabel the points of I? = {vo < f i . . . < Then 

(4.3) ^||Xt.-Xt^_J <^|/i(t;.)-/»(x'.-i)|. 

□ 

We now prove a compactness result. 

Lemma 4.2. Suppose that {/i„} are a sequence of height functions on [0,T] for a 
sequence of tree-like paths {Xn}. Suppose further that the hn are parameterised at 
speeds of at most one and that the Xn take their values in a common compact set 
within E. Then we may find a subsequence {Xn(k), hn(k)) converging uniformly to 
a Lipschitz tree-like path {Y, h). The speed of traversing h is at most one. 

Proof. The hn are equi-continuous, and in view of (|4.3p the Xn are as well. Our 
hypotheses are sufficient for us to apply the Arzcla-Ascoli theorem to obtain a 
subsequence {Xn(k)j ^n(fc)) converging uniformly to some {Y, h). In view of the fact 
that the Lip norm is lower semi-continuous in the uniform topology, we see that 
/i is a bounded variation function parameterised at speed at most one and that Y 
is of bounded variation; of course h takes the value at both ends of the interval 
[0,T]. 

Now hn{k) converge uniformly to h and hence inf„g[s hn(k) i^) ^ iiif«e[s,t] h (u) ! 
meanwhile the hn are height functions for the tree-like paths Xn and hence we can 
take limits through the definition to show that /i is a height function for Y . 

□ 

Corollary 4.3. Every Lipschitz tree-like path X has a height function h of minimal 
total variation and its total variation measure is houndedly absolutely continuous 
with respect to the total variation measure of any other height function. 
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Proof. We see that this is an immediate coroUary of Proposition [7] and Lemma ETT] 
and [421 □ 



There can be more than one minimiser h for a given X. 

5. Approximation of the path 

5.1. Representing the path as a line integral against a rank one form. Let 

7 be a path of finite variation in a finite dimensional Euclidean space V with total 
length T and parameterised at unit speed. Its parameter set is [0, T]. We note that 
the signature of 7 is unaffected by this choice of parameterisation. 

Definition 5.1. Let 7([0,T]) denote the range of 7 in and let the occupation 
measure fi on {V, B{V)) be denoted 

M(A) = |{s<r|7(,s)eA}|, Acv. 

Let n (x) be the number of points on [0, T] corresponding under 7 to a; G -B. By 
the area formulae [H] pl25-126, one has the total variation, or length, of the path 
7 is given by 

(5.1) Far (7) = y n (x) Ai (dx) , 

where Ai is one dimensional Hausdorff measure. Moreover, for any continuous 
function / 



f{j{t))dt = J fix)n{x)A,idx). 
Note that fi = n (x) Ai and that n is integrable. 

Lemma 5.2. The image under ^ of a Lebesgue null set is null for fi. That is to 
say /i(7W) = iN)\=0 if \N\ = 0. 

Definition 5.3. We wiU say that N C [0, T] is 7-stable if 7-^7 {N) = N. 

As a result of Lemma 15.21 we see that any null set can always be enlarged to a 
7-stable null set. 

The Lebesgue differentiation theorem tells us that 7 is differentiable at almost 
every u in the classical sense, and with this parameterisation the derivative will be 
absolutely continuous and of modulus one. 

Corollary 5.4. There is a set G of full /i measure in V so that 7 is differentiable 
with I7' (<)| = 1 whenever {t) G G. We set M = 7~^G. M is ^-stable. 

Now it may well happen that the path visits the same point a; G G more 
than once. A priori, there is no reason why the direct ions of the derivative on 
{t G A/ 1 7 [t) — to} should not vary. However this can only occur at a countable 
number of points. 

Lemma 5.5. The set of pairs {s,t) of distinct times in M x M for which 

7(s) = j{t) 
7'(s) ^ ±7'(i) 

is countable. 
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Proof. If 7(s«) = 7(t*) but 7'(s*) ^ ±7'(<») then, by a routine transversality 
argument, there is an open neighbourhood of {s*,U) in which there are no solutions 
of 7(s) = j{t) except s = s^,t = t^. □ 

Up to sign and with countably many exceptions, the derivative of 7 does not 

depend on the occasion of the visit to a point, only the location. Sometimes we 
will only be concerned with the unsigned or projective direction of 7 and identify 
V G S with —V. 

Definition 5.6. For clarity we introduce ~± as the equivalence relation that iden- 
tifies V and —V and let [7']-^. € S/~± denote the unsigned direction of 7. 

7'^ is defined on the full measure subset of [0, T] where 7' is defined and in S. 

CoroUciry 5.7. There is a function cj) defined on G with values in the projective 
sphere S/~± so that (j) (7 (i)) = [7' (i)]-^ . 

As a result we may define a useful vector valued 1-form /u-almost everywhere 

on G. If ^ is a vector in 5, then (^, u) ^ is the linear projection of u onto the 
subspace spanned by ^. As {^,u)S, = {—^,u) {—£,) it defines a function from S/~± 
to Hom{V,V). 

Definition 5.8. We define the tangential projection 1-form w. Let ^ be a unit 
strength vector field on G with [^]-^ = (j). Then 

u>{g,u) = {^{g),u)^{g), ^g^G^u 

defines a vector 1-form. The 1-form depends on 0, but is otherwise independent of 
the choice of ^. 

The 1-form w is the projection of u onto the line determined by (j) [g) . 
Theorem 8. 

Proposition 5.9. The tangential projection lo, defined ji a.e. on G, is a linear 
map from V ^ V with rank one. For almost every t one has 

7'(t)=c.(7(t),7'(t)) 
and as a result, using the fundamental theorem of calculus for Lipschitz functions, 

l{t) = f rf7„ + 7(0) 

Jo<u<t 

= / w o -f 7 (0) , 

Jo<u<t 

for every t <T. 

By approximating co by other rank one 1-forms we will be able to approximate 7 

by (weakly) piecewise linear paths that also have trivial signature. It will be easy 
to see that such paths are tree-like. The set of tree-like paths is closed. This will 
complete the argument. 
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5.2. Iterated integrals of iterated integrals. We now prove that if 7 has a 
trivial signature (1,0,0, . . .), then it can always be approximated arbitrarily well 
by weakly piecewise linear paths with shorter length and trivial signature. Our ap- 
proximations will all be line integrals of 1-forms against our basic path 7. Two key 
points we will need are that the integrals are continuous against varying the 1-form, 
and that a line integral of a path with trivial signature also has trivial signature. 
The Stone- Weierstrass theorem will allow us to reduce this second problem to one 
concerning line integrals against polynomial 1-forms, and in turn this will reduce 
to the study of certain iterated integrals. The application of the Stone- Weierstrass 
theorem requires a commutative algebra structure and this is provided by the coor- 
dinate iterated integrals and the shuffle product. For completeness we set this out 
below. 

Suppose that we define 

0<u\<...<Uj.<u 

and 

Zu:= I ■ J d7u,---rf7„. 

0<ui<...<Ui^<u 

then it is interesting as a general point, and necessary here, to consider iterated 
integrals of Z and Z 

It will be technically important to us to observe that such integrals can also be 
expressed as linear combinations of iterated integrals of 7 so we do this with some 
care. Some of the results stated below follow from the well known shuffle product 
and its relationship with multiplication of coordinate iterated integrals. 

Definition 5.10. The truncated or n-signature Xg"''= (l, j, j, . . . , X"^) is 
the projection of the signature X^^j to the algebra T*^"^ (V) := 0"^q V^^ of tensors 
with degree at most n. 

Definition 5.11. If e is an element of the dual space V* to V, then 7^ — (e,7„) 
is a scalar path and ^7^ — (e, dj^). If e = (ei, . . . , e^) is a list of elements of the 
dual space to V , then we define the coordinate iterated integral 

Kt--^ J ■ J dji\...djZ = {^,Kt)- 

S<Ui <...<Uj.<t 

Lemma 5.12. The map e Xf^ defined above extends uniquely as a linear map 
from T^"' {V*) to the space of real valued functions on paths of bounded variation. 

Proof Let e e T^"' {V*). Since T^") (V*) is dual to T^") {V) the pairing e ^ 
e, X^"-*^ defines a real number of each path 7. If e = (ei, . . . , e^) then this coincides 
with f; since such vectors span T(") {V*) the result is immediate. □ 



We therefore extend definition 15.111 
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Definition 5.13. For any n, e e T^") {V*) we call the e-coordinate iterated 
integral of 7 over the interval [s,t]. 

These functions on path space are important because they form an algebra under 

pointwise multiplication and because they are like polynomials and so it is easy to 
define a differentiation operator on this space. Given two tensors e, f there is a 
natural product e U f , called the shuffle product, derived from the above. For basic 
tensors 

e = ei (g) . . . (g) G V^"" 

f = h^.-.^fseV^' 

and a shuffle (tti, 712) (a pair of increasing injective functions from (1, .., r), (1, .., s) 
to (1, .., r + s) with disjoint range) one can define a tensor of degree r + s: 

'^(Ti.Trj) =Wl0...0Wr+s, 

where = ej ior j = 1, . . .r and WttjO) — fj for i = 1, . . . s. Since the ranges 

of TTi and 7r2 arc disjoint a counting argument shows that the union of the ranges 
is 1, . . . , r + s, and that ojk is well defined for all fc in 1, . . . r + s and hence ijJ{Tri,w2) 
is defined. By summing over all shuffles 

eUf= 

(7ri,7r2) 

one defines a multilinear map of V'»''xV^'^V^<^''+^\ 

Definition 5.14. The unique extension of U to a map from T {V) xT {V) —^ T {V) 

is called the shuffle product. 

The following is standard. 

Lemma 5.15. The class of coordinate iterated integrals is closed under pointwise 
multiplication. For each 7 the point-wise product of the e-coordinate iterated integral 
and the f -coordinate iterated integral is the {e Li f)-coordinate iterated integral: 

v-e yf yeUf 

Corollary 5.16. Any polynomial in coordinate iterated integrals is a coordinate 
iterated integral. 



Remark 5.17. It is at first sight surprising that any polynomial in the linear func- 
tionals on T{V) coincides with a unique linear fmictional on T (V) when restricted 
to signatures of paths and reflects the fact that the signature of a path is far from 
being a generic clement of the tensor algebra. 

A slightly more demanding remark relates to iterated integrals of coordinate 
iterated integrals. 

Proposition 5.18. The iterated integral 

(5-2) J ■ J dXt^^^...dX!;^^ 

S<Ui<...<Ur<t 

is itself a coordinate iterated integral. 
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Proof. A simple induction ensures that it suffices to consider the case 



S<Ui <Ur<t 



where 



and in this case 



e = ei(g) . . .(g) Br e (V*f 



J J dxi^^dxl^^ = J... J djii... d^xd^i\ . . . d^fs. . 

s<m<U2<t S<Vi< ...<Vr<t 

S < Wi < . . . < Ws < t 
Vr < Ws 

Expressing the integral as a sum of integrals over the regions where the relative 
ordering of the Vi and Wj arc preserved (i.e. all shuffles for which the last card 
comes from the right hand pack) we have 



// 

S<Ul<U2<t 



(eUf)®/^ 
s,t 



□ 



From this it is, of course, clear that 



Lemma 5.19. If a path has trivial signature, then all iterated integrals of its iterated 
integrals are zero. 

5.3. Bounded, measurable, and integrable forms. Recall that 7 is a path 
of finite length in V, and that it is parameterized at unit speed. The occupation 
measure is n and has total mass equal to the length T of the path 7. Let {W, ||||) 
be a normed space with a countable base (usually V itself). If a; is a /i-integrable 
1-form with values in W then we write 



Ml^{V,B{V)) = \W {y)\\Hom(V,W) M {dv) = \\^ {lt)\\Hom(V,W) 



dt. 



Proposition 5.20. Let u G {V,B{V) be a /i-integrable 1-form with values 



in W . Then the indefinite line integral yt :~ /q uj (c^7t) is well defined, linear in u), 
and a path in W with 1-variation at most \\<^\\Li(vB{v) n)- 

Proof. Since w is a 1-form defined /i-almost surely, lo (7^) € Ham {V, W) (where 
Horn {V,W) is equipped with the operator norm ||-||) is defined dt almost every- 
where. Since w is integrable, it is measurable, and hence u) (7^) is measurable on 
[0,T]. Since 7 has finite variation and is parameterized at unit speed, it is dif- 
ferentiable almost everywhere and its derivative is measurable with unit length dt 
almost surely. Hence ^(7^) (7j) is measurable and dominated by ||w(7t)||) which 
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is an integrable function, and hence u> (74) (7^) is integrable. Thus the line integral 
can be defined to be 

yt = uj{jj{jjdu 
Jo 

\\yt-ys\\ < / \\^^{lu)\\HomiVM)\\iu\\du 
J s 

t 

\\^{l^i)\\Hom(V,W)'^^ 

and so has 1-variation bounded by ||t<^|lii(y6(\/))- D 

Proposition 5.21. Let ujn S {V,B(y) , fj.) be a uniformly bounded sequence of 
integrable l-forms with values in a vector space W. Suppose that they converge in 
(V, B (V) , n) to (jj, then the signatures of the line integrals J ojn (c?7f) converge 
to the signature of jLu{djf). 

Proof. The r'th term in the iterated integral of the line integral / aj„ {djf) can be 
expressed as 

(7ui) «)•■•«) w« (7«J (7ui) ■ • ■ (7uJ dm... dur 

Q<ui<...<Ur<T 

and since w„ converge in {V,B{V) it follows that from the definition of /j, 
that ujn ilu) converge to u) (7^^) in ([0, T] , B (R) , du) almost everywhere. Thus 
{lui)®- ■ -^^n (lu,.) converges in ([0,7"]' ,B(R) ,dui . . .dur). Since ||7„|| = 1 
for almost every u, Fubini's theorem implies that 

(7„i) ®-.-®UJn (7„J (7ni) ■ • ■ (7nJ 

converges in ([0, T]^ , B (M) , dui . . . du^.) to 

(7„J ®...(^u) (7„J (7„J . . . (7„J . 
Thus, integrating over 0<ui < ... < Ur < T, the proposition follows. □ 

Corollary 5.22. Let u G L^ {V,B{V) If j has trivial signature, then so does 
j Lo (d^y^). That is to say, for each r, 

J ■ J c.(d7„J...c.(d7„J=0eTy«'-. 

0<«i,...,«r<T 

Proof. It is a consequence of Proposition 15 . 2 II that the set of L^ {V, B {V) , y) forms 
producing line integrals having trivial signature is closed. By Lusin's theorem, one 
may approximate, in the L^ (V, B (V) , ^) norm, any integrable form by bounded 
continuous forms. If the initial form is uniformly bounded then the approximations 
can be chosen to satisfy the same uniform bound. 

The support of jj, is compact, so by the Stone Weierstrass theorem, we can 
uniformly approximate these continuous forms by polynomial forms uj — "^^PiCi, 
where the pi are polynomials and are a basis for V* . Using the fact that 

^= /•■•/ dj:\...djz, 

0<Ui,...,Ur<T 
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with Corollary 15.161 and Proposition 15.181 we have that the line integrals against 
these polynomial forms and their iterated integrals can be expressed as linear 
combinations of coordinate iterated integrals. If 7 has trivial signature, then by 
Lemma [5 . 1 91 these integrals will all be zero. It follows from the {V, B {V) , fi) con- 
tinuity of the truncated signature, that the signature of the path formed by taking 
the line integral against any form oj in (V, B {V) , /i) will always be trivial. □ 

5.4. Approximating rank one 1-forms. 

Definition 5.23. A vector valued 1-form oj is (at each point of V) a linear map 
between vector spaces. We say the 1-form uj is of rank A; e N on the support of /i 
if dim (w (V)) < fc at /i almost every point in V. 

A linear multiple of a form has the same rank as the original form, but in general 
the sum of two forms has any rank less than or equal to the sum of the ranks of the 
individual components. However, we will now explain how one can approximate 
any rank one 1-form by piccewise constant rank one 1-forms lo. Additionally we will 
choose the approximations so that, for some e > 0, if w (x) ^ lo (y) and |x — t/| < e, 
then either lu (x) or uj (y) is zero. 

In other words lu is rank one and constant on patches which are separated by 
thin barrier regions on which it is zero. The patches can be chosen to be compact 
and so that the /i-measure of the compliment is arbitrarily small. 

We will use the following easy consequence of Lusin's theorem for one forms 
defined on a /^-measurable set K. : 

Lemma 5.24. Let lo be a measurable 1-form lo in (V, B {V) , /i). For each e > 
there is a compact subset L of "f [0, T] so that lo restricted to L is continuous, while 



Lemma 5.25. If lo is a measurable 1-form in {V, B (V) , fi), then for each £ > 
there are finitely many disjoint compact subsets Ki of K and a 1-form lu, that is 
zero offUiKi and constant on each Ki, such that 



and with the property that lo is rank one if lu is. 

Proof. Let L be the compact subset introduced in Lemma 15.241 Now lo [L) is 
compact. Fix £ > and choose Zi, . . . , /„ so that 





and put 




Now choose a compact set Kj C Fj \ Fj^i so that 



fi{{F,\F,^,)\K,)< 



M 



L°°iL,B(L),ii) 
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Then the Kj are disjoint and of diameter -^j^- Moreover 



and 

lkll//om(y,w')A*(c«2;) < £ 



j 

J L\ 



For each non-empty Kj choose kj E Kj. Define uj as follows. 

w (fc) = Lo (kj) , fee Kj 

= 0, keKX^l^^Kj. 



Then 



using Lemma 15.241 one has 



/ \\^-^\\Horn(V.W)f^idx) < £ 

2e 



ll^-'^llHom(y,M/)M(c^a;) <e 

K\L 



and finally 



\UJ — OJl 



Honi 



{v,w) M {dx) < 4e. 



If Lu had rank 1 at almost every point of K, then it will have rank 1 everywhere 
on L since lu is continuous. As either a) (fc) = (fcj) for some kj in L or is zero, the 
form oj has rank one also. □ 

Proposition 5.26. Consider the set V of one forms on a set K for which there 
exists finitely many disjoint compact subsets Ki of K so that the 1-form is zero off 
the Ki and constant on each Ki. The set of rank one 1-forms in V is a dense subset 
in the (V, B {V) , fijtopology of the set of rank one 1-forms in {V, B (V) , /u). 

6. PlECEWISE LINEAR PATHS WITH NO REPEATED EDGES 

We call a path 7 piecewise linear if it is continuous, and if there is a finite 
partition 

^ta <ti <t2 < .. . <tr =T 
such that 7 is linear (or more generally, geodesic) on each segment [ti,ti+i] . 

Definition 6.1. We say the path is non-degenerate if we can choose the partition 

are not collinear for any < i < r and if the 



so that 



and 



are non-zero for every < i < r. 



The positive length condition is automatic if the path is parameterised at unit 
speed and < T. If 9i is the angle ^Ja^iltilti+i^ then 7 is non-degenerate if we 
can find a partition so that for each < z < r one has 

\e,\ ^ OmodTT. 
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This partition is unique, and we refer to the 7t._j,7f. as the i-ih hnear segment 
in 7. We see, from the quantitative estimate in part 1 of Lemma 13.71 that if we 
choose 6 — imin|6'j| and scale it so that the length of the minimal segment is 

at least K [6) — log ^ i^cos\9\ ) ' then its development into hyperbolic space is non- 



trivial and so its signature is not zero. That is to say. Lemma l3 . 71 contains all the 
information you need to give a quantitative form of Chen's uniqueness result in the 
context of piecewise linear paths: 

Theorem 9. 7/7 is a non-degenerate piecewise linear path, 29 is the smallest angle 
between adjacent edges, and D > is the length of the shortest edge then there is 
at least one n for which 



1 - cos \0\ 

■ 0<Mi<...<ti„<T 

and in particular 7 has non-trivial signature. 



dim 



Proof. Choose a — K{9) /D. Isometrically embed V into SO (Id) and let Fq, be 
the development of aj. Then Fq^jo is a piecewise geodesic path in hyperbolic space 
satisfying the hypotheses in Lemma 13.71 Thus we can deduce that the distance 
d(o, Fqo) is at least K (9) > 0. As in the discussion before Theorem U] in Section 
3.4 we have 



< 



< 



ri=0 

y 1 

Z-^ r?,l 



dim ■ ■ ■ dluf 



0<«i<...<M„<T 

K{e) 



D 



dim ---dlr, 



0<«i<...<«„<T 



Now multiplying both side by e ~ we have 

K(e) ^ 1 fK{9) 

e o e^ > < e ° > — : 



D 



dim ■ ■ ■ dlu. 



0<ui<...<u„<T 

Since any integrable function has at least one point where its value equals or exceeds 
its average and since 

Kjo) ^ 1 fK{9)' " 



1 = e- 



n=0 



D 



we can conclude an absolute lower bound on the norm of the signature, against 
the Poisson measure and conclude that there is an n for which 



0<Mi<...<ti„<T 

Recalling the form of K (9) we have the result 



dim ■ ■ ■ din, 



□ 



Corollary 6.2. Any piecewise linear path 7 that has trivial signature is tree-like 
with a height function h having the same total variation as 7. 
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Proof. We will proceed by induction on the number r of edges in the minimal 
partition 

^to <ti <t2 < ... <tr 

of 7. We assume that 7 is linear on each segment [ti,ti+i] and that 7 is always 
parameterised at unit speed. 

We assume that 7 has trivial signature. Our goal is to find a continuous real 
valued function h with h > 0, h{0) = h (T) — 0, and so that for every s, t E [0, T] 
one has 

\h{s)-h{t)\ < \t~s\ 

\ls-lt\ < h{s) + hit)-2 M hiu). 

ue[s,t] 

If r = the result is obvious; in this case T — and the function h — Q does the 
job. 

Now suppose that the minimal partition into linear pieces has r > pieces. 
By Corollary [21 it must be a degenerate partition. In other words one of the 
0^ = ^7t._i7t.7t,+i must have 

\e,\ = OmodTT. 

If 9i — IT the point ti could be dropped from the partition and the path would still 
be linear. As we have chosen the partition to be minimal this case cannot occur 
and we conclude that 9i = and the path retraces its trajectory for an interval of 
length 

s = mm{\ti - ti-i\ , \ti+i -ti\) > 0. 

Now 7 (ti — m) =7 {ti + u) for u G [0, s] and either U ~ s — ti^i 01 ti + s — t^+i. 
Suppose that the former holds. Consider the path segments obtained by restricting 
the path to the disjoint intervals 

7- = 7l[o,t,_i] 

7+ = l\[U+s,T] 
T = 7l[ti-s,ti+s], 

then 7 = 7_ * r * 7^ where * denotes concatenation. 

Because the signature map 7 — > 5* (7) is a homomorphism one sees that the 
product of the signatures associated to the segments is the signature of the con- 
catenation of the paths and hence is trivial, 

5(7_)®5(r)®5(7+) = ^(7) 

= i®oeoe...eT(y). 

On the other hand the path r is a linear trajectory followed by its reverse and as 
reversal produces the inverse signature 

5 (r) = 1 ® ® ® . . . e T (T/) . 

Thus 

5* (7_) ® s (7+) = i®o®o®...er(F) 

and so the concatenation of 7_ and 7_|_ (7 with r excised) also has a trivial signature. 
As it is piecewise linear with at least one less edge we may apply the induction 
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hypothesis to conclude that this reduced path is tree-like. Let h be the height 
function for the reduced path. Then define 



It is easy to check that /i is a height function for 7 with the required properties. □ 

The reader should note that the main result of the paper Theorem [1] linking the 
signature to tree-like equivalence relies on Chen's result only through the above 
Corollary and hence only requires a version for piecewise linear paths with no 
repeated edges. Our quantitative Theorem [9] provides an independent proof of this 
result but, in this context, is stronger than is necessary; Chen's non-quantitative 
result could equally well have been used. 

We end this section with two straightforward results which will establish half of 
our main theorem. 

Lemma 6.3. If j is a Lipschitz tree-like path with height function h, then one 
can find piecewise linear Lipschitz tree-like paths converging in total variation to a 
re-parameterisation of ^ . 

Proof. Without loss of generality we may re-parametrise time to be the arc length 
of h. Since h is of bounded variation, using the area formula (jS.ip . we can find 
finitely many points u„ within 5 of one another and increasing in [0, T] so that h 
takes the value h (un) only finitely many times and only at the times Consider 
the path 7^ that is linear on the intervals (u„, and agrees with 7 at the times 

Un- Define /i„ similarly. Then /i„ is a height function for 7„ and so 7„ is a tree. 
The paths 7„ converge to 7 uniformly, and in p- variation for all p > 1. However, 
as we have parameterised h by arc length, it follows that the total variation of 7 
is absolutely continuous with respect to arc length. As 7„ is a martingale with 
respect to the filtration determined by the successive time partitions, applying the 
martingale convergence theorem, it follows that 7„ converges to 7 in Li. □ 

Corollary 6.4. Any Lipschitz tree-like path has all iterated integrals equal to zero. 

Proof. For piecewise linear tree-like paths it is obvious by induction on the number 
of segments that all the iterated integrals are 0. Since the process of taking iterated 
integrals is continuous in p- variation norm for p < 2, and Lemma 16.31 proves that 
any Lipschitz tree-like path can be approximated by piecewise linear tree-like paths 
in 1-variation the result follows. □ 

In the next section we introduce the concept of a weakly piecewise linear path. 
After reading the definition, the reader should satisfy themselves that the arguments 
of this section apply equally to weakly piecewise linear paths. 



Paths that lie in lines are special. 

Definition 7.1. A continuous path 7^ is weakly linear (geodesic) on [0,T] if there 
is a line I (or geodesic I) so that 7^ £ I for all t e [0, T]. 



h{u) 
h{u) 
h{u) 



h{u) ,u e [0,ti_i] 

h{u-2.s) ,u e [t, + s,T] 

s — \ti — u\ -\- h (t,;-i) ,u £ [ti — s,ti -\- s] . 



7. Weakly piecewise linear paths 



Suppose that 7 is smooth enough that one can form its iterated integrals. 
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Lemma 7.2. If j is weakly linear, then the n-signature of the path 7(i)tg[oT] 



E 



(7t - 7o) 



(gin 



n! 

In particular the signature of a weakly linear path is trivial if and only if the path 
has = 7o or, equivalently, that it is a loop. 

Lemma 7.3. A weakly geodesic, and in particular a weakly linear, path with 7o = 

is always tree-like. 

Proof. By definition, 7 lies in a single geodesic. Define h{t) = d (7q, 7^) . Clearly 

/i (0) = h (T) = 
h > 0. 

If h{u) = at some point u G (s, t) then 

rf(7s,7t) < c?(7o,7,s) + c?(7o>7t) 

= h{s) + h{t)-2 inf h{u) 
«e[s,t] 

while if h{u) > at all points u € (s, t) then 7^ and 7^ are both on the same side 
of 7q in the geodesic. Assume that d (79, 7^) > c? (73, 7^), then 

dils^lt) = f^(7o>7s) -c?(7o>7t) 
= h{s)-h (t) 

< h{s) + h{t)-2 inf h (u) . 

ue[s,t] 

as required. □ 

There are two key operations, splicing and excising, which preserve the triviality 
of the signature and (because we will prove it is the same thing) the tree-like 
property. However, the fact that excision of tree-like pieces preserves the tree-like 
property will be a consequence of our work. 

Definition 7.4. If 7 e F is a path taking [0, T] to the vector space V, t £ [0, T] 
and r is a second path in V, then the insertion of r into 7 at the time point t is 
the concatenation of paths 

7l[0,t] *T * 7l[t,T]- 

Definition 7.5. If 7 € F is a path on [0,T] ,with values in a vector space V, and 
[s,t] C [0,T], then 7 with the segment [s,t] excised is 

7l[o,s] * 7l[*,T]- 

Remark 7.6. Note that these definitions make sense for paths in manifolds as well 
as in the linear case, but in this case concatenation requires the first path to finish 
where the second starts. We will use these operations for paths on manifolds, but it 
will always be a requirement for insertion that r is a loop based at 7^, for excision 
we require that ^\[s,t] is a loop. 

We have the following two easy lemmas: 

Lemma 7.7. Suppose that 'y G M is a tree-like path in a manifold M, and that r 

is a tree-like path in M that starts at 7^, then the insertion of t into 7 at the point 
t is also tree-like. Moreover, the insertion at the time point t of any height function 
for T into any height function coding 7 is a height function for 7|[o,t] *t* 'y\[t,T] ■ 
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Proof. Assume 7 G M is a tree-like path on a domain [0,T], by definition there is 
a positive and continuous function h so that for every s, s in the domain [0, T] 

rf(7.,7i) < h{s) + h{s)-2 inf h{u), 

h{0) = h (T) = 0. 
In a similar way, let the domain of r be [0, R] and let g be the height function for r 
diTs,Ts) < g{s)+g{s)-2 inf 

u^[s,s] 

9(0) = giR) = 0. 

Now insert g in /i at t and r in 7 at t. Let h, 7 be the resulting functions defined 
on [0,T + R]. Then 

7(s) = 7(s), 0<s<t 
7 (s) = r (s - , t<s<t + R 

7 (s) = 7 (s - iJ) , t + R<s<T + R, 

and 

h{s) = h{s), 0<s<t 
h{s)=g{s-t), t<s<t + R 

h{s) = h{s-R), t + R<s<T + R, 

where the definition of these functions foi s G [t + R,T + R] uses the fact that r 

and g are both loops. 

Now it is quite obvious that if s, s G [0, T + i?] \ [t, t + R], then 

d{l,,ls) < h{s) + h{~s)-2 inf h{u) 

ue[s,S]\[t,t+R] 

< h{s) + h{s)-2 inf h{u) 

h{0) = h{T + R) = 
and that for s,s G [t,t + R], 

< g{s-t)+g{s-t)-2 inf g{u) 

uE.[s—t,s—t\ 

= h{s) + h{s)-2 inf h{u). 

uG[s,s] 

To finish the proof we must consider the case where 0<s<t<s<t + R and the 
case where 0<t<s<t + R<s<T + R. As both cases are essentially identical 
we only deal with the first. In this case 

diJs^s) = rf(7,s>^.5-() 

< d{ls,lt) + diTo,Ts-t) 

< h{s) + h{t)-2 inf h{u) + g {s - t) - g (0) 
= h{s) + h{s)-2 inf h{u) 

u£[s,i] 



< h{s) + h{s)-2 inf h (u) . 



□ 
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Remark 7.8. The argument above is straightforward and could have been left to 
the reader. However, we draw attention to the converse result, which also seems 
very reasonable: that a tree-like path with a tree-like piece excised is still tree-like. 
This result seems very much more difficult to prove. The point is that the height 
function one has initially, as a consequence of 7 being tree-like, may well not certify 
that T is tree- like even though there is a second height function defined on [s, t] that 
certifies that it is. A direct proof that there is a new height function simultaneously 
attesting to the tree-like nature of 7 and r seems difficult. Using the full power of 
the results in the paper, we can do this for paths of bounded variation. 

Lemma 7.9. Let be j a path defined on [0, T] with values in V and suppose that 
l\[s.t] has trivial signature where [s,t] C [0,T]. Then 7 has trivial signature if and 
only if ^ with the segment [s,t] excised has trivial signature. 

Proof. This is also easy. Since the signature map is a homomorphism we see that 
7 = 7I [o,s] *7l [s,t] *7l [t,T] 
^(7) = ^(7l[o..]) ® ^(7l[s,t]) ®^(7l[t,T]) 
and by hypothesis S {^\[s^t]) is the identity in the tensor algebra. Therefore 
S{l) - 5(7|[o.])®^(7l[t.T]) 
= 'S'(7l[o.s] *7l[t,T]) • 

□ 

Definition 7.10. A continuous path 7, defined on [0,T] is weakly piecewise linear 
(or more generally, weakly geodesic) if there are finitely many times 

Q = ta<ti<t2< ...<tr=T 

such that for each < i < the path segment 7[t; ^ (.] is weakly linear (geodesic)Q 

Our goal in this section is to prove, through an induction, that a weakly linear 
path with trivial signature is tree-like and construct the height function. As before, 
every such path admits a unique partition so that 

Lemma 7.11. If ^ is a weakly piecewise linear path, then there exists a unique 
partition = to < ti < t2 < . . . < tr ^ T so that the linear segments associated to 

are not collinear for any <i < r. 



7t,_i>7t, 



7t.>7t,^ 



We will henceforth only use this partition and refer to r as the number of seg- 
ments in 7. 

Definition 7.12. We say 7 is fully non-degenerate if, in addition, Jn-i 7^ Iti fo^' 
every < i < r. 

Lemma 7.13. If j is a weakly linear path with trivial signature and at least one 
segment, then there exist < i < r so that 7f = 7t. • 

Proof. The arguments in the previous section on piecewise linear paths apply 
equally to weakly piecewise linear and weakly piecewise geodesic paths. In par- 
ticular Corollary 13.71 only refers to the location of 7 at the times ti at which the 
path changes direction (by an angle different from tt). □ 



^The geodesic will always be unique since the path has unit speed and ti < t^+i so contains 
at least two distinct points. 
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Proposition 7.14. Any weakly piecewise linear path 7 with trivial signature is 
tree-like with a height function whose total variation is the same as that of 7. 

Proof. The argument is a simple induction using the lemmas above. If it has no 
segments we are clearly finished with h = 0. We now assume that any weakly 
piecewise linear path 7^''"^^ consisting of at most r — 1 segments, with trivial 
signature is tree-like with a height function whose total variation is the same as that 
of 7'^''"^^. Suppose that 7*^''^ is chosen so that it is a weakly piecewise linear path 
of r segments with trivial signature but there was no height function coding it as a 
tree-like path with total variation controlled by that of 'y^'^\ Then, by Lemma FZ.lSi 

(r) (r) 

in the standard partition there must be < i < r so that ^tiU — lu ^ ^^'^ by 
assumption ti-i < ti. In other words, the segment 7^'^^|[ti_i,ti] is a weakly linear 
segment and a loop. It therefore has trivial signature, is tree-like and the height 
function we constructed for it in the proof of Lemma [7?3l was indeed controlled by 
the variation of the loop. 

Let 7 be the result of excising the segment 7|[ti_i,ti] from 7^''). As 7^''^|[ti_i,ti] has 
trivial signature, by Lemma 7 also has trivial signature. On the other hand, 7 
is weakly piecewise linear with fewer edges than 7 (it is possible that 7 restricted 
to [ti-2,ti-i] and are collinear and so the number of edges drops by more 

than one in the canonical partition - but it will always drop!). So by induction, 
7 is tree-like and is controlled by some height function h that has total variation 
controlled by the variation of 7. 

Now insert the tree-like path 7'''^'|[ti_i,ti] into 7. By Lemma [7.71 this will be 
tree-like and the height function is simply the insertion of the height function for 
7*''^'* l[ti-i,ti] i'^^'^ that for 7 and by construction is indeed controlled by the variation 
of 7'-''^ as required. Thus we have completed our induction. □ 

8. Proof of the main theorem 

We can now combine the results of the last sections to conclude the proof of our 
main theorem and its corollaries. 

Proof of Theorem [IJ Corollary 16.41 establishes that tree-like paths have trivial sig- 
nature. 

Thus we only need to establish that if the path of bounded variation has trivial 
signature, then it is tree- like. By Lemma [5.91 we can write the path as an integral 
against a rank one 1-form. By Corollary 15.261 we can approximate any rank one 
1-form by a sequence of rank one 1-forms with the property that each 1-form is 
piecewise constant on finitely many disjoint compact sets and elsewhere. By 
integrating 7 against the sequence of 1-forms wc can construct a sequence of weakly 
piecewise linear paths approximating 7 in bounded variation. By Corollary I5.22[ 
these approximations have trivial signature. By Proposition 17. 14l this means that 
these weakly piecewise linear paths must be tree-like. Hence we have a sequence of 
tree- like paths which approximate 7. By re-parametrizing the paths at unit speed 
and using Lemma [4.21 -7 must be tree-like, completing the proof. □ 

Proof of Corollary \1.6[ Recall that we defined X ^ Y, by the relation that X then 
Y run backwards is tree-like. The transitivity is the part that is not obvious. 
However, we can now say X Y if and only if the signature of XY~^ is trivial. 
As multiplication in the tensor algebra is associative, it is now simple to check the 
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conditions for an equivalence relation. Denoting the signature of X by X etc. one 

sees that 

1. The path run backward has signature YX~^ = — XY^^ = 0. 

2. XX- 1 = by definition. 

3. If X - y and y - Z, then XY"^ ^ and YZ"^ ^ 0. Thus 

= (XY^i) (YZ^^) = X (Y-i Y) Z-i = X (0) Tr^ = XZ^^ 

and hence X ^ Z required. 

It is straightforward to see that the equivalence classes form a group. □ 

Proof of Corollary \1.7\ In order to deduce the existence and uniqueness of min- 
imisers for the length within each equivalence class we observe that; 

1. We can re-parameterise the paths to have unit speed and thereafter to be con- 
stant. Then by the compactness of the equivalence classes of paths with the same 
signature and bounded length, any sequence of paths will have a subscquential 
uniform limit with the same signature. As length is lower-semicontinuous in the 
uniform topology, the limit of a sequence of paths with length decreasing to the min- 
imum will have length less than or equal to the minimum. We have seen, through 
a subsubsequence where the height functions also converge, that it will also be in 
the same equivalence class as far as the signature is concerned, so it is a minimiser. 

2. Within the class of paths with given signature and finite length there will always 
be at least one minimal element. Let X and Y be two minimisers parameterised at 
unit speed, and let hhe a height function for XY~^ . Let the time interval on which 
h is defined be [0, T] and let r denote the time at which the switch from XloY 
occurs. The function h is monotone on [0, r] and on [t, T] for otherwise there would 
be an interval [s, t] C [0, r] with h[s) — h (t). Then the function u h (u) — h (s) 
is a height function confirming that the restriction of X to [s, t] is tree- like. Now we 
know from the associativity of the product in the tensor algebra that the signature 
is not changed by excision of a tree-like piece. Therefore, X with the interval [s,t] 
excised is in the same equivalence class as X but has strictly shorter length. Thus 
X could not have been a minimiser - as it is, we deduce the function h is strictly 
monotone. A similar argument works on [r, T] . 

Let a : [0, r] [t, T] be the unique function with 

h{t) = h{a {t)) . 

Then a is continuous decreasing and cr (0) ~ T and a (r) ~ t. Moreover, X^ — 
yr-cr(M) and so we see that (up to reparameterisations), the two paths are the same. 
Hence we have a unique minimal element! □ 

Appendix A. 

A.l. Trees and paths - background information. We have shown in this paper 
that trees have an important role as the negligible sets of control theory, quite 
analogous to the null sets of Lebesgue integration. The trees we need to consider are 
analytic objects in flavour, and not the finite combinatorial objects of undergraduate 
courses. In this appendix we collect together a few related ways of looking at them, 
and prove a basic characterisation generalising the concept of height function. 
We first recall that 

(1). Graphs {E,V) that are acyclic and connected are generally called trees. If 
such a tree is non-empty and has a distinguished vertex v it is called a rooted tree. 
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(2) . A rooted tree induces and is characterised by a partial order on V with least 
element v. The partial order is defined as foUows 

a <h if the circuit free path from the root v — > 6 goes through a. 

This order has the property that for each fixed b the set {a ^ 5} is totally ordered 
by <. 

Conversely any partial order on a finite set V with a least element v and the 
property that for each h the set {a < b} is totally ordered defines a unique rooted 
tree on V . One of the simplest ways to construct a tree is to consider a (finite) 
collection f2 of paths in a graph with all paths starting at a fixed vertex, and with 
the partial order that ^ iff is an initial segment of lj' . 

(3) . Alternatively, let [E, V) be a graph extended into a continuum by assigning a 
length to each edge. Let d (a, b) be the infimum of the lengths of pathsQ between 
the two vertices o, b in the graph. Then 17 is a geodesic metric on V . Trees are 
exactly the graphs that give rise to 0-hyperbolic metrics in the sense of Gromov (see 
for example ^4j). 

(4) . There are many ways to enumerate the edges and nodes of a finite rooted tree. 
One way is to think of a family tree recording the descendants of a single individual 
(the root). Start with the root. At the root, if all children have been visited stop, 
at any other node, if all the children have been visited, move up to the parent. If 
there are children who have not been visited, then visit the oldest unvisited child. 
At each time n the enumeration either moves up an edge or down an edge - each 
edge is visited exactly twice. Let h (n) denote the distance from the top of the 
family tree after n steps in this enumeration with the convention that h{0) = 0, 
then h is similar to the path of a random walk, moving up or down one unit at each 
step, except that it is positive and returns to zero exactly as many times as there 
are edges coming from the root. Hence h (2 \E\) = 0. 

The function h completely describes the rooted tree. The function h directly yields 
the nearest neighbour metric on the tree. If /i is a function such that h (0) = 0, 
it moves up or down one unit at each step, is positive and h(2 \E\) — 0, then d 
defined by 

d (to, n) = h (to) + h (n) — 2 inf h (u) , 

[m.n] 

is a pseudo-metric on [0,2|F|]. If we identify points in [0, 2|V^|] that are zero 
distance apart and join by edges the equivalence classes of points that are distance 
one apart, then one recovers an equivalent rooted tree. 

Put less pedantically, let the enumeration be a at step n and b at step to and 
define 

d (a, b) — h (m) + h {n) — 2 inf h (u) , 

utE [m.n] 

then it is simple to check that d is well defined and is a metric on vertices making 
the set of vertices a tree. 

Thus excursions of simple (random) walks are a convenient (and well studied) 
way to describe abstract graphical trees. This particular choice for coding a tree 
with a positive function on the interval can be extended to describe continuous 
trees. This approach was used by Le Gall f5i in his development of the Brownian 
snake associated to the measure valued Dawson- Watanabe process. 



^the sum of the lengths of the edges 
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A. 2. R-trees are coded by continuous functions. One of the early examples 
of a continuous tree is the evolution of a continuous time stochastic process, where, 
as is customary in probability, one identifies the evolution of two trajectories until 
the first time they separate. (This idea dates back at least to Kolmogorov and his 
introduction of filtrations). Another popular and equivalent approach to continuous 
trees is through R-trees (fTO| p425 and the references there). 

Interestingly, analysts and probabilists have generally rejected the abstract tree 
as too wild an object, and usually add extra structure, essentially a second topology 
or Borel structure on the tree that comes from thinking of the tree as a family of 
paths in a space which also has some topology. This approach is critical to the 
arguments used here, as we prove our tree-like paths are tree-like by approximating 
them with simpler tree-like paths. (They would never converge in the 'hyperbolic' 
metric). In contrast, group theorists and low dimensional topologists have made a 
great deal of progress by studying specific symmetry groups of these trees and do 
not seem to find their hugeness too problematic. 

Our goal in this subsection of the appendix is to prove the simple representation: 
that the general R-tree arises from identifying the contours of a continuous function 
on a locally connected and connected space. The height functions we considered 
on [0, T] are a special case. 

Definition A.l. An R-tree is a uniquely arcwise connected metric space, in which 
the arc between two points is isometric to an interval. 

Such a space is locally connected, for let be the set of points a distance at 
most \/n from x. If z £ Bx, then the arc connecting x with z is isometrically 
embedded, and hence is contained in B^. Hence Bx is the union of connected sets 
with non-empty common intersection (they contain x) and is connected. The sets 
Bx form a basis for the topology induced by the metric. Observe that if two arcs 
meet at two points, then the uniqueness assertion ensures that they coincide on the 
interval in between. 

Fix some point v as the 'root' and let x and y be two points in the R-tree. 
The arcs from x and y to v have a maximal interval in common starting at v and 
terminating at some wi, after that time they never meet again. One arc between 
them is the join of the arcs from x to vi to y (and hence it is the arc and a geodesic 
between them). Hence 

d {x, y) = d {x, v) + d (y, v) — 2d {v, vi) . 

Example A. 2. Consider the space Q, of continuous paths Xt E E where each path 
is defined on an interval [0,^ (w)) and has a left limit at [0,^ (<^))- Suppose that if 
AT G is defined on [0,^), then A|[q^) € ^ for every s less than ^. Define 

d (w, (jj') =^{uj)+^ (w') - 2 sup {t < min (^ (w) , ^ (w')) I (s) = to' (s) Vs < i} ■ 
Then (f2, d) is an i?-tree. 

We now give a way of constructing R-trees. The basic idea for this is quite easy, 
but the core of the argument lies in the detail so we proceed carefully in stages. 

Let / be a connected and locally connected topological space, and h : I ^ M. he 
a positive continuous function that attains its lower bound at a point v £ I . 

Definition A. 3. For each x E I and X < h (x) define Cx,x to be the maximal 
connected subset of {y \ h (y) > A} containing x. 
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Lemma A. 4. The sets Cx,\ exist, and are closed. Moreover, if Cx,\ H C^'.y ^ 4> 
and A < A', then 

Cx',X' C Cx.X- 

Proof. An arbitrary union of connected sets with non-empty intersection is con- 
nected, taking the union of aU connected subsets of {y \ h (y) > A} containing x 
constructs the unique maximal connected subset. Since h is continuous the closure 
Dx,\ of Cx,\ is also a subset of {y \ h (y) > A}. The closure of a connected set is 
always connected hence Dx^x is also connected. It follows from the fact that Cx,\ 
is maximal that Cx.x — Dx^\ and so is a closed set. 
If Cx,x n Cx'.x' ^ (t> and A < A', then 

X eCx,xUCx',x' (l{y\h{y)>\}, 
and since Cx^x H C'x',x' 7^ 0j the set Cx^x U Cx\x' is connected. Hence maximality 
ensures Cx^x = Cx.x U Cx',x' ^tnd hence Cx'^x' C Cx^x- D 

Corollary A. 5. Either C'x.x equals Cx>,x or it is disjoint from it. 

Proof. If they are not disjoint, then the previous Lemma can be applied twice to 
prove that Cx',x C C'x^x and Cx,x C C'x',x. □ 

Corollary A. 6. If Cx.x = Cx'.x, then Cx,x" — Cx\x" for all X" < A. 

Proof. The set Cx.x, Cx' ,x are nonempty and have nontrivial intersection. Cx.x C 
Cx X" and Cx' ^x C Cxi ^x" hence Cx^x" and Cx' ^x" have nontrivial intersection. Hence 
they are equal. □ 

Corollary A.7. y e Cx,x if and only if Cy^tiy) C Cx^x- 

Proof. Suppose that y G Cx.Xi then Cy ^y^ and Cx.x are not disjoint. It follows from 
the definition of Cx^x and y G Cx.x that h{y) > A. By Lemma lA.41 Cy h (.^) C Cx^x- 
Suppose that Cyj^^y-j C Cx,x, since y G Cy ^^y^ it is obvious that y G Cx,x- D 

Definition A. 8. The set Cx '■= Cx.h(x) is commonly referred to as the contour of 
h through x. 

The map x Cx induces a partial order on / with x ^ y if Cx ^ Cy. If h 
attains its lower bound at x, then Cx = I since {y \ h (y) > h (x)} = I and / is 
connected by hypothesis. Hence the root v <y for all y € I. 

Lemma A. 9. Suppose that A G [h{v) ,/i(a;)], then there is a y in Cx,x such that 
h{y) — \ and, in particular, there is always a contour (Cx^x) at height A through y 
that contains x. 

Proof. By the definition of Cx,x it is the maximal connected subset of ft > A 
containing x; assume the hypothesis that there is no y in Cx^x with h (y) = A so 
that it is contained in /i > A, hence Cx^x is a maximal connected subset of ft > A. 
Now ft > A is open and locally connected, hence its maximal connected subsets of 
ft > A are open and Cx,x is open. However it is also closed, which contradicts the 
connectedness of the /. Thus we have established the existence of the point y. □ 

The contour is obviously unique, although y is in general not. If we consider the 
equivalence classes x~y if x < y and y ^ x, then we see that the equivalence classes 
[y]~ of y :< x are totally ordered and in one to one correspondence with points in 
the interval [ft {v) , ft (x)]. 
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Lemma A. 10. If z € Cy.x and h{z) > X, then z is in the interior of Cy^x. If 
Gx'.x' C Cx^x with A' > A, then C^.x is a neighbourhood of Cx\x' ■ 

Proof. I is locally connected, and h is continuous, hence there is a connected 
neighbourhood U of z such that h{z) > A. By maximality U C Cz.x- Since 
Cz,A n Cy^x ^ 4> we have Cz.x — Cy^x and thus U C Cy^x- Hence Cy^x is a neigh- 
bourhood of z. The last part follows trivially once by noting that for all z e C^^x' 
we have /i (z) > A' > A and hence Cy,x is a neighbourhood of z. □ 

We now define a pseudo-metric on /. Lemma lA.lOl (the only place we will use 
local connectedness) is critical to showing that the map from I to the resulting 
quotient space is continuous. 

Definition A. 11. If y and z are points in /, define X{y,z) < min {h {y) , h {z)) 
such that Cy^x = Cz.x 

X{y,z)^snp{X\Cy,x = Cz,x, X<h{y), X<h{z)}. 

The set 

{X\Cy^x^Cz,x, X<h{y), X<hiz)} 
is a non-empty interval [h (v) , A (y, z)] or [h (v) , A (y, z)) where A (y, z) satisfies 

h{v) <X{y,z) < mm {h{y),h{z)). 
Clearly A (a;, x) — h [x) . 

Lemma A. 12. The function X is lower semi- continuous 

liminf A (j/, z) > A (y, zq) . 

Proof. Fix y, zq and choose some A' < A(y, zq). By the definition of A(y, zq) we 
have that Cy x' — Cz^.x'- Since h{zo) > X' there is a neighbourhood U of zq so 
that U C Czo.x'- For any z ^ U one has z e Cz.x' H Cz^.x'- Hence Cza,x' ~ Cz,x' 
and Cy^x' = Cz,a' • Thus A (y, z) > A' for z £U and hence 

liminf A (y, z) > A'. 

Since A' < A (y, zq) was arbitrary 

liminf A (y, z) > A(y, zo) 

and the result is proved. □ 

Lemma A. 13. The following inequality holds 

min {A {x, z) , A (y, z)} < A {x, y) . 

Proof. If min {A (x, z) , A (y, z)} = /i (i;), then there is nothing to prove. Recall that 

{X\Cy,x^Cz,x, X<h{y), X<h{x)} 

is connected and contains h (v). Suppose h (v) < X < min {A {x, z) , A (y, z)}, then 
it follows that the identity Cx,x = Cz.x holds for A. Similarly Cy^x = Cz,x- As a 
result Cx.x — Cy^x and A {x,y) > A. □ 

Definition A. 14. Define c? on / x / by 

d {x, y) ^ h{x) + h (y) - 2A {x, y) . 
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Lemma A. 15. The function d is a pseudo-metric on I. If [jjdj is the resulting 

quotient metric space, then the projection I ^ I from the topological space I to the 
metric space is continuous. 

Proof. Clearly d is positive, symmetric and we have remarked that for all x, A {x, x) = 
h {x) hence it is zero on the diagonal. To see the triangle inequality, assume 

A {x, z) = min {A {x, z) , A (y, z)} 

and then observe 

d {x, y)=h{x)+h (y) - 2A {x, y) 

< h{x) + h{y)-- 2A (x, z) 

= h{x) + h (z) - 2A {x, z) + h{y)-h {z) 

<d{x,z) + \h{y)-h{z)\ 

but A (y, z) < min {h {y) , h (z)) and hence 

\h (y) -h{z)\^hiy) + h{z)-2 min {h (y) , h (z)) 
<h(y) + h (z) - 2A {y, z) 
= d{y,z) 

hence 

d{x,y) < d{x,z) + d{y,z) 
as required. □ 

We can now introduce the equivalence relation x~y\id {x, y) = and the quotient 
space //". We write I/~ = I and i : 7 — > 7 for the canonical projection. The 
function d projects onto I x I and is a metric there. 

It is tempting to think that x~y if and only if Cx = Cy and this is true if I is 
compact Hausdorff. However the definitions imply a slightly different criteria: x~y 
iff 

h {x) = h [y) = A and Cx.y = Cy^y for all A" < A. 

The stronger statement x~y \i and only if Cx = Cy is not true for all continuous 
functions h on as it is easy to find a decreasing family of closed connected sets 
there whose limit is a closcid set that is not connected. 

Consider again the new metric space I that has as its points the equivalence 
classes of points indistinguishable under d. We now prove that the projection i 
taking I to I is continuous. Fix y E I and e > 0. Since A {y, .) is lower semi- 
continuous and h is (upper semi)continuous there is a neighbourhood of y so that 
for z gU one has A {y, z) > X {y, y) — e/4 and h{z) < h (y) + e/2. Thus d {y, z) < s 
for z G U. Hence d {i {y) , i (z)) < e if z G U. The function i is continuous and as 
continuous images of compact sets are compact we have the following. 

CoroUciry A. 16. 7/7 is compact, then I is a compact metric space. 

To complete this section we will show 7 is a uniquely arcwise connected metric 
space, in which the arc between two points is isometric to an interval and give a 
characterisation of compact trees. 
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Proposition A. 17. // / is a connected and locally connected topological space, 
and h : I M is a positive continuous function that attains its lower bound, 
then its "contour tree" the metric space (^I^dj is an H-tree. Every R -tree can be 
constructed in this way. 

Proof. It is enough to prove that the metric space / we have constructed is really 
an R-tree and that every R-tree can be constructed in this way. Let x any point 
in / and x G / satisfy i (x) = x. Then h (x) does not depend on the choice of 
X. Fix h{v) < X < h{x). We have seen that there is a y such that h{y)—X and 
y < X moreover any two choices have the same contour through them and hence 
the same y (A). In this way we see that there is a map from [h {v) , h {x)] into / 
that is injective. Moreover, it is immediate from the definition of d that it is an 
isometry and that / is uniquely arc connected. 

Suppose that is an R-tree, then we may fix a base point, and for each point 
in the tree consider the distance from V it is clear that this continuous function is 
just appropriate to ensure that the contour tree is the original tree. □ 

Remark A. 18. 1. In the case where / is compact, obviously / is both complete and 
totally bounded as it is compact. 

2. An i?-tree is a metric space; it is therefore possible to complete it. Indeed the 
completion consists of those paths, all of whose initial segments are in the trecH; 
we have not identified a simple sufficient condition on the continuous function and 
topological space Q. to ensure this. An i?-tree is totally bounded if it is bounded 
and for each e > there is an N so that for each t the paths that extend a distance 
t from the root have at most N ancestral paths between them at time t — e. In 
this way we see that the i?-tree that comes out of studying the historical process 
for the Fleming- Viot or the Dawson Watanabe measure-valued processes is, with 
probability one, a compact i?-tree for each finite time. 

Lemma A. 19. Given a compact R-tree, there is always a height function on a 
closed interval that yields the same tree as its quotient. 

Proof. As the tree is compact, path connected and locally path connected, there is 
always as based loop mapping [0, 1] onto the tree. Let h denote the distance from 
the root. Its pullback onto the interval [0, 1] is a height function and the natural 
quotient is the original tree. In this way we see that there is always a version of Le 
Gall's snake "F traversing a compact tree. □ 
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